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Abstract 


The  research  program  described  in  this  report  addresses  several  generic  avenues  of 
opportunity  in  the  advancement  of  a  sophisticated  component  technology  base  for 
applications  in  optical  signal  processing,  optical  information  processing,  and  optical 
computing.  As  such,  the  research  program  is  multifaceted  as  well  as  highly 
interdisciplinary,  spanning  activities  from  materials  growth,  processing  and  characterization 
through  device  invention  and  evaluation  to  preliminary  system  level  integration. 


The  primary  program  thrusts  include: 

(1)  Identification  of  the  fundamental  limitations  inherent  in  optical  information 
processing  and  computing  systems  that  derive  from  physical  laws,  and 
discrimination  of  these  limitations  from  those  performance  boundaries  that 
result  from  device  technological  and  materials  parameter  considerations; 

(2)  An  in-depth  analysis  of  optical  signal  processing,  optical  information 
processing,  and  optical  computing,  with  consideration  of  present  levels  of 
accomplishment  and  expected  future  development; 

(3)  The  invention  and  extensive  characterization  of  a  novel  differential 
interferometric  readout  technique  for  high  contrast  ratio  parallel  readout  of 
optical  discs  employed  as  two-dimensional  spatial  light  modulators; 
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(4)  A  critical  assessment  of  the  potential  for  fabrication  of  spatial  light  modulators 
and  volume  holographic  optical  elements  in  the  III-V  or  II- VI  compound 
semiconductor  materials  systems  through  the  use  of  layered,  multiple  quantum 
well,  and  superlattice  structures; 

(5)  Analysis  and  implementation  of  real  time  volume  holographic  optical  elements, 
including  a  novel  multilayered  structure  (the  so-called  stratified  volume 
holographic  optical  element,  or  SVHOE)  with  unique,  potentially  programmable 
diffraction  characteristics; 

(6)  The  development  of  advanced  techniques  for  the  utilization  of  multi-dimensional 
dynamically  programmable  interconnections  in  hybrid  optical/electronic 
multiprocessors  based  on  VLSI  and  WSI  technologies; 

(7)  The  invention,  analysis,  and  development  of  a  novel  holographic  recording  and 
readout  technique  that  allows  highly  multiplexed,  weighted  interconnections  to 
be  established  in  real  time  photorefractive  materials,  characterized  by  high 
throughput  efficiency  and  very  low  interchannel  crosstalk; 

(8)  The  practical  development  of  photorefractive  volume  holographic  optical 
elements,  including  the  incorporation  of  novel  two-layer  antireflection  coatings 
on  very  high  index  substrates  to  eliminate  the  reduction  in  diffraction  efficiency 
attributed  to  the  presence  of  multiple  reflections,  and  the  invention  of  methods 
for  the  elimination  of  a  catastrophic  collapse  of  the  internal  electric  field  in 
photorefractive  materials  under  holographic  recording  conditions; 

(9)  Optimization  of  the  growth  and  processing  of  important  electrooptic  single 
crystal  materials  such  as  bismuth  silicon  oxide,  lithium  niobate,  and  strontium 
barium  niobate; 

(10)  Development  of  novel  materials  characterization  techniques  for  dielectric  single 
crystals  and  thin  films,  such  as  the  electrooptic  measurement  of  the  volume 
resistivity  of  photorefractive  materials,  for  integration  with  the  growth  and 
processing  effort;  and 


(11)  Analysis  and  optimization  of  a  number  of  candidate  electrooptic  spatial  light 
modulator  structures  based  on  bulk  single  crystal  materials,  including  the 
Photoreffactive  Incoherent-to-Coherent  Optical  Converter  (PICOC),  a  Pockels 
Readout  Optical  Modulator  (PROM)  with  no  dielectric  blocking  layers  and  a 
<1 1 1>  crystallographic  orientation,  single  channel  and  linear  array  total  internal 
reflection  spatial  light  modulators,  and  the  Optically  Modulated  Total  Internal 
Reflection  (OMTIR)  spatial  light  modulator. 
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1 .  PROGRAM  SUMMARY 

The  research  program  described  «n  this  report  is  focused  on  a  critical  evaluation  of 
advanced  photonic  materials  and  device  concepts  for  the  implementation  of  optical 
information  processing  and  computing  systems.  The  effort  ranges  from  a  detailed 
investigation  of  the  fundamental  physical  and  technological  limitations  that  impact  the 
potential  computational  gain  ( e.g .  increases  in  throughput,  decreases  in  decision  time 
subsequent  to  processing,  or  minimization  of  the  energy  expended  during  computation)  of 
optical  information  processing  and  computing  systems,  through  the  invention  and 
characterization  of  key  enabling  devices  such  as  two-dimensional  spatial  light  modulators 
and  volume  holographic  optical  elements,  to  the  development  of  advanced  techniques  for 
materials  growth,  deposition,  and  processing  that  have  a  critical  impact  on  potential  device 
performance.  As  such,  the  research  program  is  necessarily  interdisciplinary,  involving 
both  faculty  and  students  with  physics,  mathematics,  electrical  engineering,  optics, 
materials  science,  and  chemical  engineering  expertise.  This  multifaceted  evaluation  of 
novel  materials,  device,  and  system  concepts  has  been  directly  responsible  foi  the  invention 
and  characterization  of  a  number  of  photonic  devices  and  materials  processing  techniques 
that  exhibit  both  high  performance  and  capacity  for  practical  manufacturing. 

The  research  period  covered  by  this  report  extends  from  15  May,  1987  through  14 
May,  1990.  In  addition  to  the  set  of  publications  attached  as  Appendices  to  this  report,  a 
significant  number  of  manuscripts  are  in  advanced  stages  of  preparation,  and  will  be 


submitted  for  publication  during  the  remaining  program  phase  and  included  in  the  Final 
Technical  Report. 

The  primary  program  thrusts  can  be  organized  into  three  principal  categories:  (1) 
fundamental  and  technological  limitations  of  optical  information  processing  and  computing; 
(2)  electrically  and  optically  addressed  spatial  light  modulators;  and  (3)  volume  holographic 
optical  elements.  The  principal  results  of  the  research  program  in  each  category  are 
oudined  below.  Further  technical  details  and  a  guide  to  the  various  publications,  as  well  as 
continuing  directions  of  research,  are  provided  in  Section  2  (Progress  During  the  Contract 
Period). 

A  detailed  study  of  the  fundamental  as  well  as  technological  constraints  that  apply  to 
an  optically  based  information  processing  and  computing  technology  has  been  undertaken. 
The  purpose  of  the  study  is  to  delineate  potential  areas  of  opportunity  that  can  be  addressed 
by  both  optical  and  photonic  techniques,  in  combination  with  electronic  technology  where 
appropriate.  A  major  result  of  the  study  thus  far  has  been  an  evaluation  of  the  tradeoffs 
inherent  in  computation  based  on  both  analog  and  binary  (digital)  representations.  Analog 
processing  is  favorable  from  an  energy  metric  perspective  whenever  the  computational 
complexity  of  the  algorithm  (or  architecture)  is  large  enough  to  overcome  an  inherently 
higher  representation  cost  for  a  given  dynamic  range.  A  related  result  is  that  currently 
available  analog  optical  devices  operate  much  closer  to  the  relevant  quantum  limits  than 
either  currently  available  binary  optical  or  electronic  devices.  These  results  are  in  the 
process  of  being  applied  to  analog  electronic  circuitry  as  well  for  direct  comparison.  In  a 
parallel  effort,  the  fundamental  limits  of  the  photorefractive  grating  recording  sensitivity 
have  been  established.  These  limits  clearly  explain  the  apparent  insensitivity  of  currently 
investigated  photorefractive  grating  recording  materials,  and  imply  several  promising 
opportunities  for  critically  needed  sensitivity  enhancement. 


Three  different  types  of  electrically  and/or  optically  addressed  spatial  light  modulators 
(SLMs)  have  been  invented,  developed,  and  characterized  during  the  course  of  the  research 
program  to  date.  These  devices  include  an  optical  disc  spatial  light  modulator,  a  multiple 
quantum  well  (MQW)  asymmetric  Fabry-Perot  spatial  light  modulator,  and  the 
Photorefractive  Incoherent-to-Coherent  Optical  Converter  (PICOC).  The  optical  disc 
spatial  light  modulator  leverages  existing  compact  disc  read-only  memory  (CD-ROM) 
technology  by  incorporating  area  encoding  techniques  and  a  differential  interferometric 
readout  configuration  to  yield  a  high  resolution  SLM  with  high  contrast  (100:1),  in  situ 
memory  capacity,  scrolling  capability,  and  near  term  insertion  prospects.  The  multiple 
quantum  well  asymmetric  Fabry-Perot  spatial  light  modulator  offers  potential  hybrid 
integrability  of  III-V  compound  semiconductor  modulation  elements  with  silicon-based 
detection  and  control  circuitry,  and  features  a  novel  inverted  cavity  design  made  possible  by 
the  incorporation  of  InGaAs/GaAs  strained  layer  multiple  quantum  wells  on  a  transparent 
GaAs  substrate.  The  Photorefractive  Incoherem-to-Coherent  Optical  Converter  is  a  two- 
dimensional  spatial  light  modulator  that  exhibits  remarkable  fabrication  simplicitv  by 
making  use  of  the  incoherent  (image-bearing)  erasure  of  a  coherently  recorded  grating  in  a 
photorefractive  material  such  as  bismuth  silicon  oxide  (Bii2Si02o)- 

Associated  with  the  optical  disc  spatial  light  modulator  characterization  effort  has  been 
a  parallel  effort  to  develop  an  appropriate  optical  recording  media  test  facility.  To  this  end, 
we  have  participated  in  the  modification  of  a  commercially  available  optical  media  tester 
(Apex  Systems  OHMT-300)  that  is  capable  of  recording  two-dimensional  image-formatted 
patterns  on  a  wide  range  of  optical  disc  media  with  an  extremely  high  degree  of  track-to- 
track  accuracy.  The  development  and  acquisition  of  this  major  piece  of  capital  equipment 
represents  a  significant  program  achievement,  and  has  allowed  for  preliminary  tests  that 
demonstrate  the  viability  of  optical  disc  based  spatial  light  modulators. 
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In  the  area  of  volume  holographic  optical  elements  and  interconnections,  significant 
progress  has  been  achieved  on  three  separate  issues:  (1)  the  development  of  a  viable 
approach  to  highly  multiplexed  weighted  interconnections  based  on  holographic  techniques; 
(2)  the  characterization  and  implementation  of  Stratified  Volume  Holographic  Optical 
Elements  (SVHOEs);  and  (3)  the  development  and  characterization  of  photorefractive 
materials  and  devices  for  interconnection  applications.  During  the  research  program,  we 
have  invented  a  novel  approach  for  utilizing  volume  holographic  optical  elements  in 
conjunction  with  arrays  of  individually  coherent  but  mutually  incoherent  sources  and  t  wo- 
dimensional  spatial  light  modulators  to  provide  highly  multiplexed  and  weighted 
interconnections  characterized  by  high  throughput  efficiency,  low  interchannel  crosstalk, 
and  capability  for  simultaneous  (as  opposed  to  sequential)  initial  recording  and  weight 
updates.  We  have  also  identified  and  demonstrated  a  number  of  novel  features  of  SVHOE 
devices  that  have  applications  in  programmable  interconnections,  tunable  frequency 
filtering,  and  wavelength  multiplexing/demultiplexing.  In  addition  to  the  investigation  of 
the  fundamental  limitations  of  photorefractive  materials  as  used  in  interconnection 
applications  described  above,  we  have  also  attempted  to  improve  the  practical 
implementation  of  photorefractive  devices.  This  effor  has  included  the  development  of 
multilayer  antireflection  coatings  that  eliminate  the  source  of  multiple  internal  reflections, 
resulting  in  100%  increases  in  the  saturation  diffraction  efficiency  and  two  beam  coupling 
gain;  the  identification  and  characterization  of  the  dramatic  electric  field  collapse  within 
photorefractive  crystals  that  occurs  during  grating  recording,  by  means  of  a  novel 
transverse  electrooptic  imaging  method;  the  development  of  techniques  to  minimize  or 
eliminate  the  electric  field  collapse,  resulting  in  increases  in  diffraction  efficiency  and 
response  time;  the  development  of  a  novel  electrooptic  technique  for  the  measurement  of 
very  high  dark  resistivities  in  electrooptic  crystals  (which  determine  the  grating  storage 
time);  the  analysis  of  the  polarization  properties  of  diffraction  in  optically  active  and 
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electrooptic  photorefractive  materials,  including  the  effects  of  self-diffraction  during  grating 
recording;  and  the  observation  and  characterization  of  the  effects  of  microscopic  charge 
screening  on  sub-hologram  formation  in  photorefractive  materials. 
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2 .  PROGRESS  DURING  THE  CONTRACT  PERIOD 

As  discussed  in  the  Program  Summary,  the  central  theme  of  this  research  program  is 
a  critical  assessment  of  the  prospects  for  the  implementation  of  optical  information 
processing  and  computing  systems,  based  on  a  parallel  assessment  of  advanced  photonic 
materials  and  device  concepts.  As  such,  the  principal  program  elements  have  included 
studies  of  the  fundamental  physical  and  technological  constraints  that  impact  current  and 
projected  computational  performance;  invention,  development,  and  characterization  of 
critical  optical  processing  and  computing  components,  such  as  one-  and  two-dimensional 
spatial  light  modulators  and  volume  holographic  optical  elements;  and  the  development  of 
advanced  techniques  for  materials  growth,  deposition,  and  processing  that  have  a  critical 
impact  on  potential  device  performance.  In  this  section,  we  describe  the  most  significant 
results  of  the  research  program  to  date,  provide  a  guide  to  the  various  publications  that 
document  these  results,  and  indicate  continuing  directions  of  research  where  appropriate. 


Fundamental  Physical  and  Technological  Constraints  on  Optical 
Information  Processing  and  Computing 

In  order  to  examine  the  fundamental  physical  and  technological  limitations  to  optical 
information  processing  and  computing,  we  have  considered  any  computational  process  to 
comprise  three  separate  functions:  the  representation  of  information,  the  implementation  of 
computational  complexity,  and  the  detection  of  results.  This  separation  provides  a  key  for 
the  analysis  of  any  proposed  algorithm,  architecture,  and  implementation  scheme  from  the 
perspective  of  a  given  metric,  such  as  the  total  energy  required  to  complete  a  particular 
calculation.  In  calculating  the  total  energy  dissipation  inherent  in  a  computational  process, 
it  is  necessary  to  include  the  detection  and  storage  of  the  inputs,  the  implementation  of  the 
computation,  the  costs  of  communication  (interconnection)  within  the  processor,  and  the 
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detection  and  communication  of  the  results.  The  energy  metric  is  of  considerable 
importance,  as  many  currently  envisioned  computational  systems  (both  optical  and 
electronic)  are  highly  power  consumptive,  and  are  therefore  limited  in  performance  by  the 
thermodynamics  of  cooling. 

Our  approach  has  been  to  examine  the  computational  process  for  a  number  of 
important  computational  functions  (e.g.  two-dimensional  Fourier  transformation,  image- 
based  correlations,  synthetic  aperture  radar  image  formation,  and  nonlinear  dynamical 
systems  such  as  neural  networks)  that  have  proven  difficult  to  implement  by  electronic 
means,  without  resort  to  large  scale  systems  that  have  size,  weight,  and  power 
characteristics  that  make  them  inappropriate  for  a  wide  range  of  applications.  We  have 
examined  both  the  fundamental  boundaries  implied  by  quantum  statistics  and 
thermodynamics,  as  well  as  the  technological  boundaries  implied  by  the  choice  of  particular 
components  within  a  given  implementation  (such  as  spatial  light  modulators  based  on  III-V 
compound  semiconductor  multiple  quantum  well  modulators  hybridized  with  silicon 
detection  and  control  circuitry,  for  example)  [Appendices  4,  7,  8,  9,  and  10;  Pubis.  4,  16; 
Book  Chs.  2,  3;  Conf.  Pres.  3,  5,  7,  13,  15,  19,  33,  43,  46,  51,  52,  56], 

One  of  the  most  striking  results  to  emerge  from  this  study  has  been  a  realization  of  the 
dramatic  difference  in  energy  cost  that  exists  between  the  digital  (binary)  and  analog 
representations  of  a  given  number  at  the  same  probability  of  error  (the  equivalent  of  a  bit 
error  rate)  due  to  quantum  statistical  fluctuations  alone.  The  digital  representation  cost 
scales  as  the  logarithm  of  the  overall  dynamic  range,  while  by  contrast  the  analog 
representation  cost  scales  quadratically  with  increasing  dynamic  range.  This  result  has 
direct  implications  for  computation,  in  the  sense  that  computational  algorithms  based  on 
analog  representations  can  be  more  energy  efficient  at  the  fundamental  limits  only  if  the 
computational  complexity  of  the  process  implied  by  the  analog-based  algorithm  is  sufficient 
to  make  up  for  the  increased  representation  cost.  The  implications  of  this  result  also  extend 
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to  the  representation  of  information,  for  example,  on  optically  addressed  spatial  light 
modulators,  in  which  case  a  clear  tradeoff  exists  among  resolution  (number  of  independent 
pixels),  frame  rate,  dynamic  range,  and  sensitivity  for  a  given  probability  of  representation 
error.  In  many  cases,  the  stated  performance  characteristics  quoted  for  certain  spatial  light 
modulators  cannot  be  obtained  simultaneously  without  provision  for  an  unacceptably  high 
input  intensity. 

Given  the  difference  in  representation  cost,  we  have  compared  a  number  of 
implementations  of  such  computational  processes  as  the  Fourier  transform  that  are  analog 
or  digital,  and  optical  or  electronic  in  nature,  at  both  the  fundamental  (quantum-limited) 
boundaries,  and  at  the  current  limits  of  available  device  technologies  at  the  device,  circuit, 
and  systems  integration  levels.  The  results  are  quite  surprising.  At  the  fundamental  limit, 
analog  (optical)  and  digital  (electronic)  approaches  can  exhibit  comparable  energy  costs.  At 
the  technological  level,  however,  electronic  devices,  circuits,  and  systems  operate  at 
progressively  larger  factors  away  from  the  fundamental  limits,  whereas  currently  available 
optical  components  are  capable  of  operating  much  closer  to  the  appropriate  fundamental 
constraints.  We  are  in  the  process  of  extending  this  line  of  inquiry  to  include  analog 
electronic  circuits  and  systems,  as  well  as  hybrid  photonic  components  comprising  both 
optical  and  electronic  elements.  (Appendices  4  and  8;  Publ.  4;  Book  Ch.  3;  Conf.  Pres.  3, 
5,  7,  13,  15,  19,  33,  43,  51,  52,  56] 

A  second  major  avenue  of  investigation  has  been  to  examine  the  fundamental  and 
technological  limitations  of  spatial  light  modulation,  in  particular  focusing  on  the 
advantages  and  disadvantages  inherent  in  phase  and/or  amplitude  modulation,  on  methods 
for  optimizing  the  utilization  of  available  oscillator  strength  (particularly  in  multiple 
quantum  well  modulators),  and  on  the  optimization  of  asymmetric  cavities  to  enhance 
device  sensitivity,  contrast  ratio,  and  throughput  [Appendices  4,  7,  8, 9,  and  10;  Pubis.  4, 
16;  Book  Chs.  2,  3;  Conf.  Pres.  3,  5,  7,  13,  15,  19,  33,  43,  46,  51,  52,  56].  In 
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particular,  we  have  examined  in  detail  a  hybrid  spatial  light  modulator  with  InGaAs/GaAs 
multiple  quantum  well  modulators  that  utilize  the  quantum  confined  Stark  effect  in  an 
asymmetric  Fabry-Perot  cavity  configuration,  in  conjunction  with  silicon  driver  chips  that 
contain  appropriate  detection  and  control  circuitry  [Appendices  8  and  10;  Publ.  16;  Book 
Ch.  3;  Conf.  Pres.  46,  51,  56]. 

Both  pure  phase  and  pure  amplitude  modulation  cases  have  been  examined  from  the 
perspective  of  minimizing  signal -dependent  amplitude  modulation  in  the  phase  modulator 
case,  and  of  minimizing  signal-dependent  phase  modulation  in  the  amplitude  modulator 
case.  The  results  of  the  study  indicate  that  reflection  amplitude  modulators  with  contrast 
ratios  in  excess  of  20: 1  can  be  achieved  with  a  dynamic  range  of  about  50%  (implying  a  3 
dB  insertion  loss),  with  acceptable  signal-dependent  phase  modulation  and  optical 
bandwidth.  Examination  of  the  sensitivity  of  these  results  to  process-induced  variations 
(such  as  thickness  nonuniformities  characteristic  of  current  state-of-the-art  molecular  beam 
epitaxy  (MBE)  techniques)  revealed  that  such  spatial  light  modulator  designs  must  be 
detuned  from  optimized  (theoretically  achievable)  performance  parameters  in  order  to  obtain 
the  requisite  uniformity  across  a  two-dimensional  array.  This  study  is  continuing  in 
conjunction  with  a  device  fabrication  and  characterization  effort  in  collaboration  with  Prof. 
Anupam  Madhukar's  research  group  at  USC. 

During  the  contract  period  we  have  also  undertaken  and  completed  a  study  of  the 
fundamental  physical  limitations  of  the  photorefractive  grating  recording  sensitivity,  in 
order  to  evaluate  the  prospects  for  application  of  photorefractive  materials  to  multiplexed 
interconnection  applications  [Appendices  4  and  7;  Publ.  4;  Book  Chs.  2,  3;  Conf.  Pres.  3, 
5,  7,  13,  15,  33].  In  this  study,  we  determined  the  quantum  limited  photorefractive 
sensitivity  that  can  be  achieved  assuming  a  single  photoexcited  carrier  for  each  absorbed 
photon,  and  compared  the  resulting  optimum  charge  distribution  with  those  that  arise  from 
sinusoidal  grating  exposure  profiles  in  conjunction  with  realistic  charge  transport  models. 
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These  results  clearly  explain  the  fundamental  origins  of  the  observed  relatively  low 
quantum  efficiencies  of  photorefractive  recording  processes  in  materials  such  as  bismuth 
silicon  oxide  and  barium  titanate,  and  point  out  several  potential  directions  for  enhancement 
of  the  sensitivity.  The  importance  of  the  grating  recording  sensitivity  in  optical  processing 
and  computing  systems  applications  is  that  it  establishes  the  maximum  reconfiguration  rate 
of  a  volume  holographic  optical  element  or  interconnection  device  that  can  be  achieved  with 
a  given  (average)  optical  power. 

Electrically  and  Optically  Addressed  Spatial  Light  Modulators 

During  the  course  of  the  research  program,  a  number  of  different  types  of  electrically 
and/or  optically  addressed  spatial  light  modulators  have  been  invented,  developed,  and 
characterized.  These  devices  include  an  optical  disc  spatial  light  modulator,  a  multiple 
quantum  well  (MQW)  asymmetric  Fabry-Perot  spatial  light  modulator,  and  the 
Photorefractive  Incoherent-to-Coherent  Optical  Converter  (PICOC).  The  optical  disc 
spatial  light  modulator  is  electrically  addressed  (though  optically  written),  while  the  latter 
two  devices  are  optically  addressed.  These  three  devices  span  a  considerable  range  of 
spatial  light  modulator  functions  and  performance  characteristics,  as  well  as  potential  for 
inexpensive  manufacturability  and  near-term  insertion. 

Optical  disc  spatial  light  modulators  were  first  proposed  by  us  [Conf.  Pubis.  15  and 
24]  in  order  to  take  advantage  of  the  significant  materials  and  device  development  leverage 
provided  by  commercially  available  CD-ROM  technology.  In  this  spatial  light  modulator 
approach,  the  format  of  an  optical  disc  is  altered  to  allow  for  the  recording  of  a  two- 
dimensional  image  in,  for  example,  a  1000  x  1000  pixel  array  within  a  1  cm2  area.  Grey 
scale  is  provided  by  area  encoding  techniques,  such  that  within  each  "pixel"  containing  100 
binary  bits,  any  number  of  bits  can  be  written  between  0  and  100,  providing  for  a  binary- 
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encoded  analog  representation.  On  readout,  each  pixel  is  under-resolved  to  allow  the  grey 
scale  to  be  realized  without  imaging  individual  bits. 

In  order  to  accomplish  such  readout  with  a  contrast  ratio  compatible  with  the  desired 
grey  scale,  we  developed  a  novel  differential  interferometric  readout  configuration  to 
circumvent  the  inherently  low  contrast  ratios  (about  2:1)  characteristic  of  commercially 
available  CD-ROMs  that  are  optimized  for  digital  data  recording  applications.  In  this 
configuration,  a  shear  plate  is  used  to  create  two  orthogonally  polarized  readout  beams, 
displaced  from  each  other  along  the  track  direction  by  one  half  of  the  inter-bit  spacing.  On 
readout,  two  independent  images  of  the  optical  disc  are  created,  which  recombine  within 
the  shear  plate  (cancelling  the  displacement)  and  interfere  with  each  other  when  passed 
through  an  appropriately  oriented  polarization  analyzer.  A  7t  phase  shift  is  introduced 
between  the  beams  by  the  shear  plate,  such  that  the  beams  exactly  cancel  everywhere  except 
where  bits  have  been  written.  In  this  manner,  the  background  reflections  from  the  tracks 
and  grooves  are  eliminated,  and  written  bits  appear  as  a  pair  of  bright  dots  against  an 
essentially  black  background.  In  experimental  tests  of  the  technique  using  a  mirror  in  place 
of  the  optical  disc,  we  have  achieved  contrast  ratios  in  excess  of  600: 1  as  limited  by  the  AR 
coatings  of  the  various  optical  elements.  On  double  sided  ablative  media  without  AR 
coatings,  contrast  ratios  of  5:1  have  been  measured,  whereas  on  single  sided  bump  forming 
media  the  experimentally  achieved  contrast  ratios  exceed  20: 1  [Appendix  1 1 ;  Pubis.  11, 12; 
Conf.  Pres.  15,  24,  33,  36,  40,  47,  52,  55]. 

In  order  to  write  two-dimensional  images  directly  onto  an  optical  disc  in  a  format  that 
allows  for  parallel  readout,  the  traditional  sequential  bit  recording  process  must  be 
modified.  In  collaboration  with  Apex  Systems  in  Boulder,  Colorado,  we  have  developed 
an  optical  media  tester  that  is  capable  of  very  high  (sub-micron)  track-to-track  accuracy, 
and  of  recording  on  a  wide  range  of  optical  disc  media  including  ablative,  bump-forming, 
and  magneto-optic.  In  addition,  the  design  is  optimized  to  minimize  the  time  required  to 


record  each  two-dimensional  image.  Using  this  optical  media  tester,  which  was  recently 
acquired  during  the  contract  period,  we  have  successfully  written  a  wide  range  of  both 
binary  and  analog  two-dimensional  images  on  various  optical  disc  media.  The  largest 
images  written  to  date  have  512  x  512  pixels  with  101  binary-encoded  grey  scale  levels. 
Parallel  readout  of  these  images  using  the  differential  interferometric  readout  configuration 
has  been  accomplished  with  striking  results.  Further  investigations  are  under  way  to 
optimize  the  recording  parameters  utilized,  as  well  as  the  readout  configuration. 

One  of  the  most  useful  features  of  the  optical  disc  spatial  light  modulator  is  its 
capability  for  operation  in  a  "scrolling1'  readout  mode,  as  is  desirable  for  example  in 
synthetic  aperture  radar  image  formation.  In  the  scrolling  mode,  it  is  necessary  to  read  out 
the  entire  image  for  subsequent  processing  following  the  arrival  of  each  additional  line  of 
data,  with  the  consequent  deletion  from  the  field  of  view  of  the  oldest  line  of  data. 
Operating  in  this  readout  mode,  the  optical  disc  is  currently  capable  of  parallel  readout  rates 
exceeding  5  terabits  of  information  per  second. 

A  second  type  of  spatial  light  modulator  that  we  have  actively  investigated  during  the 
program  period  is  a  hybrid  SLM  in  which  the  active  modulator  elements  are  based  on 
InGaAs/GaAs  multiple  quantum  wells  in  an  asymmetric  Fabry-Perot  cavity  configuration, 
and  the  optical  address  function  is  carried  out  by  detection  and  control  circuitry  integrated 
on  a  silicon  chip  by  standard  VLSI  foundry  processing.  We  have  chosen  the  strained  layer 
InGaAs/GaAs  system  for  epitaxial  growth  on  GaAs  substrates  in  order  to  achieve  substrate 
transparency  at  the  operational  wavelength  of  about  950  nm.  This  substrate  transparency 
can  be  used  to  advantage  by  allowing  for  an  inverted  asymmetric  Fabry-Perot  cavity 
geometry  in  which  the  low  reflectivity  Bragg  mirror  is  grown  by  MBE  techniques  (in  Prof. 
Anupam  Madhukar's  laboratory)  below  the  MQW  structure,  and  the  high  reflectivity  mirror 
is  an  externally  deposited  dielectric  multilayer  coating  [Appendix  10;  Publ.  16;  Book  Ch.  3; 
Conf.  Pubis.  46,  48, 49,  50,  51,  52,  54,  56].  This  configuration  relieves  the  MBE  growth 


process  of  including  the  (several  micron  thick)  high  reflectivity  Bragg  mirror  below  the 
MQW  structure,  at  once  minimizing  the  growth  time  and  complexity  as  well  as  the 
accumulation  of  strain  and  lattice  defects.  In  addition,  it  allows  for  face-to-face  contact  by 
means  of  flip-chip  bonding  techniques  between  the  GaAs  substrate  containing  the 
modulator  elements,  and  the  Si  chip  containing  the  detectors  and  control  electronics.  This 
in  turn  eliminates  the  need  for  vias  through  both  substrates,  as  is  characteristic  of  traditional 
hybridization. 

To  date  we  have  performed  a  thorough  analysis  of  the  potential  performance 
characteristics  that  can  be  expected  from  such  a  hybrid  SLM  based  on  InGaAs/GaAs 
MQWs  (as  well  as  on  AlGaAs/GaAs  MQWs)  in  order  to  determine  both  optimized  mirror 
and  quantum  well  designs,  as  well  as  the  sensitivity  of  such  designs  to  anticipated  process- 
induced  variations.  On  the  basis  of  this  analysis,  we  have  grown  a  number  of  candidate 
modulator  structures,  for  which  extensive  optical  and  electrical  characterization  is  in 
progress  at  the  present  time.  In  every  case  examined  thus  far,  the  predictions  based  on  our 
design  analysis  have  been  borne  out.  In  parallel  with  this  effort,  we  have  also  designed 
and  fabricated  (through  the  Metal-Oxide-Semiconductor  Implementation  Service  (MOSIS) 
operated  for  DARPA  by  USC's  Information  Sciences  Institute)  a  number  of  silicon  chips 
that  contain  various  types  of  photodetectors  and  control  circuitry.  These  chips  are  fully 
functional,  and  arc  in  process  of  being  characterized  to  evaluate  their  maximum  operational 
bandwidth,  the  accuracy  of  the  control  circuitry  functional  transformation,  and  the  optical 
sensitivity  of  the  photodetectors.  Preliminary  studies  of  the  requisite  flip-chip  bonding 
requirements  are  under  way. 

The  final  type  of  spatial  light  modulator  that  has  been  developed  and  characterized 
during  this  research  program  is  the  Photorefractive  Incoherent-to-Coherent  Optical 
Converter  (PICOC),  which  has  the  unusual  feature  of  combining  photorefractive  volume 
holographic  grating  recording  techniques  with  incoherent-to-coherent  conversion  capability 


within  the  same  device  [Appendix  6;  Book  Ch.  1].  This  feature  at  once  allows  for  simple 
and  inexpensive  device  fabrication,  as  any  of  a  number  of  bulk  photorefractive  crystals  can 
be  used  to  configure  this  device  without  the  need  for  additional  device  processing.  In 
addition,  it  allows  for  a  number  of  unique  optical  information  processing  and  computing 
applications  that  involve  both  spatial  light  modulation  and  volume  holographic  recording 
techniques.  During  the  contract  period,  an  intensive  study  of  the  factors  that  affect  PICOC 
operational  mode,  readout  configuration,  sensitivity,  resolution,  and  contrast  ratio  was 
completed.  The  results  of  this  study  are  described  in  detail  in  Appendix  6. 

Volume  Holographic  Optical  Elements 

One  of  the  most  important  potential  advantages  of  optical  techniques  for  application  to 
information  processing  and  computing  systems  is  the  capacity  for  parallel,  densely  packed 
crosstalk-free  interconnections  among  multiple  processing  planes  provided  by  volume 
holographic  recording  and  reconstruction  techniques.  Should  real  time  implementations  of 
such  optical  interconnections  not  prove  viable,  this  advantage  may  prove  to  be  minimal,  if  it 
exists  at  all.  As  such,  a  critical  feature  of  this  research  program  has  been  the  evaluation  of 
the  prospects  for  high  performance  volume  holographic  interconnections,  as  well  as  the 
development  of  practical  photorefractive  materials  and  devices. 

During  the  contract  period,  significant  progress  has  been  achieved  in  the  development 
of  a  novel  approach  to  highly  multiplexed  weighted  interconnections  based  on  holographic 
techniques,  on  the  characterization  and  implementation  of  Stratified  Volume  Holographic 
Optical  Elements  (SVHOEs),  and  on  the  development  and  characterization  of 
photorefractive  materials  and  devices  for  interconnection  applications.  Progress  in  each  of 
these  three  areas  is  summarized  below. 

We  have  recently  discovered  that  the  traditional  methods  for  recording  volume 
holographic  interconnections  lead  to  either  high  interchannel  crosstalk  in  the  case  of  fully 
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coherent,  simultaneous  recording  schemes,  or  to  interchannel  crosstalk,  high  throughput 
losses,  and  complex  recording  schedules  in  the  case  of  sequential  (coherent  or  incoherent) 
recording  schemes.  By  utilizing  the  optical  beam  propagation  method,  we  have  simulated  a 
number  of  such  point-to-point  weighted  interconnections  with  multiple  holographically 
recorded  gratings,  and  have  been  able  to  quantify  the  degree  of  crosstalk  and  throughput 
loss  in  each  case  [Appendix  8;  Pubis.  9  and  10;  Book  Ch.  3;  Patent  1;  Conf.  Pres.  37,  38, 
39,  41,  42,  45,  49,  50,  52,  53,  54,  57].  In  addition,  we  have  identified  a  new  form  of 
crosstalk,  so-called  beam  degeneracy  crosstalk,  that  produces  substantial  interchannel 
crosstalk  even  in  the  incoherent  recording  case,  and  accounts  for  a  dramatic  throughput  loss 
as  well.  For  an  N  input  point  to  N  output  point  holographic  interconnection  recorded  by 
traditional  sequential  methods,  for  example,  the  optical  throughput  efficiency  is  reduced  by 
of  order  1/N.  This  loss  is  potentially  catastrophic  for  large  scale  interconnection  networks 
such  as  those  envisioned  for  optical  implementations  of  neural  networks. 

Subsequent  to  this  analysis,  we  have  invented  an  architecture  based  on  two- 
dimensional  parallel  source  arrays,  spatial  light  modulators,  and  volume  holographic  optical 
elements  that  is  capable  of  circumventing  the  crosstalk  and  throughput  problems,  as  well  as 
providing  for  simultaneous  rather  than  sequential  weight  updates  [Appendix  8;  Pubis.  9 
and  10;  Book  Ch.  3;  Patent  1;  Conf.  Pres.  37,  38,  39,  41,  42,  45,  49,  50,  52,  53,  54,  57]. 
In  this  approach,  a  two-dimensional  source  array  (e.g.  the  surface  emitting  semiconductor 
diode  laser  arrays  recently  announced  by  Bellcore  and  ATT)  is  employed,  in  which  each 
source  is  individually  coherent,  but  all  sources  are  mutually  incoherent  over  the  response 
time  of  the  holographic  medium.  Two  optically  addressed  spatial  light  modulators  are 
employed  to  represent  the  input  and  output  planes.  Parallel  illumination  of  the  spatial  light 
modulators  by  the  source  array  produces  (with  appropriate  optics)  sets  of  mutually 
incoherent  holograms  in  the  volume  holographic  recording  medium,  all  of  which  are 
doubly  angularly  multiplexed  to  avoid  the  effects  of  beam  degeneracy. 


Simulations  of  this  architecture  using  the  optical  beam  propagation  method  have 
shown  a  remarkable  reduction  in  interchannel  crosstalk,  accompanied  by  a  corresponding 
substantial  increase  in  optical  throughput  efficiency.  Laboratory  demonstrations  of  8  x  8 
and  2  x  64  interconnections  in  real  time  photorefractive  crystals  have  confirmed  the 
theoretical  and  numerical  predictions.  Extension  of  both  the  numerical  simulations  and  the 
laboratory  demonstrations  to  larger  array  sizes  is  under  way  at  the  present  time. 

The  Stratified  Volume  Holographic  Optical  Element  or  SVHOE  is  a  unique  optical 
element  capable  of  emulating  volume  holographic  diffraction  characteristics  in  devices 
consisting  of  only  thin  layered  photosensitive  materials,  and  at  the  same  time  exhibits  a 
large  number  of  novel  properties  that  are  useful  in  array  interconnection,  wavelength 
multiplexing  and  demultiplexing,  and  spatial  frequency  filtering  applications  [Appendices  3 
and  5;  Pubis.  3,  5,  6,  7;  Conf.  Pres.  1,  2,  4,  6,  9,  11,  12,  14,  15,  16,  17,  21,  29,  32,  33, 
49,  50,  52], 

An  SVHOE  consists  of  a  number  of  thin  photosensitive  layers  separated  by  substrate 
(or  buffer)  layers  that  are  optically  insensitive.  This  type  of  construction  is  particularly 
advantageous  for  materials  that  exhibit  large  photoinduced  refractive  index  variations,  but 
that  are  difficult  to  fabricate  in  the  thicknesses  required  for  highly  multiplexed  volume 
holographic  device  operation  (of  order  1  mm  to  1  cm).  Examples  of  such  materials  include 
the  commercially  available  DuPont  Holopolymer  material,  nonlinear  organic  materials,  and 
ni-V  compound  semiconductor  multiple  quantum  well  structures.  In  operation,  the  device 
is  exposed  to  two  coherent  recording  beams  that  interfere  in  the  photosensitive  layers  to 
record  the  hologram,  and  readout  is  performed  in  direct  analogy  with  other  volume 
holographic  optical  elements. 

A  number  of  novel  diffraction  characteristics  of  SVHOE  structures  have  been 
identified,  numerically  modeled,  and  in  most  cases  experimentally  demonstrated  during  the 


contract  period.  These  include  a  unique  periodic  angular  tuning  characteristic,  the 
emulation  of  Bragg  limited  angular  response  characteristics,  the  generation  of  regularly 
spaced  arrays  of  output  angles  (or  positions)  when  illuminated  by  a  strongly  focused  beam, 
spatial  frequency  notch  filtering,  wavelength  notch  filtering,  and  wavelength  multiplexing 
and  demultiplexing.  In  addition,  SVHOE  structures  fabricated  with  active  photorefraetive 
materials,  such  as  III-V  compound  semiconductor  multiple  quantum  well  structures  that  are 
voltage  or  field  enabled,  are  capable  of  exhibiting  tunable  diffraction  characteristics  that  can 
be  externally  modulated  by  altering  the  distribution  of  applied  voltages  on  a  layer  by  layer 
basis.  Preliminary  studies  of  such  active  SVHOE  structures  in  quantum  well  devices  have 
been  completed,  and  fabricadon  and  evaluation  of  test  structures  is  under  way.  Finally, 
demonstration  of  key  SVHOE  characteristics  with  the  DuPont  Holopolymer  material  is 
underway  in  a  cooperative  effort  with  DuPont,  in  order  to  take  advantage  of  the  in  situ 
fixing  capabilities  of  this  class  of  organic  materials. 

In  addirion  to  the  investigation  of  the  fundamental  limitations  of  photorefracdve 
materials  (as  used  in  interconnection  applications)  described  above  [Appendices  4  and  7; 
Publ.  4;  Book  Chs.  2,  3;  Conf.  Pres.  3,  5,  7,  13,  15,  33],  we  have  also  undertaken  a 
multifaceted  effort  to  improve  the  prospects  for  practical  implementation  of  photorefractive 
devices.  This  effort  has  included  the  development  of  multilayer  antireflection  coatings  on 
Bii2SiC>20,  LiNb03,  BaTiC>3,  SBN,  GaAs,  and  CdTe  that  eliminate  the  source  of  multiple 
internal  reflections  within  the  photorefractive  material  [Pubis.  13,  14,  15;  Conf.  Pres.  25, 
33,  34,  35, 49,  50,  52],  resulting  in  100%  increases  in  the  saturation  diffraction  efficiency 
and  two  beam  coupling  gain;  the  identification  and  characterization  of  the  electric  field 
collapse  within  photorefractive  crystals  that  occurs  during  grating  recording,  by  means  of  a 
novel  transverse  electrooptic  imaging  method  [Publ.  17;  Conf.  Pres.  27,  37,  33];  the 
development  of  a  novel  electrooptic  technique  for  the  measurement  of  very  high  dark 
resistivities  in  electrooptic  crystals  (which  determine  the  grating  storage  time)  [Appendix  1; 


Publ.  1;  Conf.  Pres.  8,  18,  22];  the  application  of  the  electrooptic  measurement  technique 
to  the  determination  of  the  dark  resistivities  of  undoped,  doped,  and  nonstoichiometric 
single  crystals  of  bismuth  silicon  oxide  [Appendix  1;  Publ.  1;  Conf.  Pres.  8,  18,  22],  the 
analysis  of  the  polarization  properties  of  diffraction  in  optimally  active  and  electrooptic 
photoreffactive  materials,  including  the  effects  of  self-diffraction  during  grating  recording 
[Appendix  2;  Publ.  2;  Conf.  Pres.  26];  and  the  observation  and  characterization  of  the 
effects  of  microscopic  charge  screening  on  sub-hologram  formation  in  photorefractive 
materials  [Publ.  8;  Conf.  Pres.  30],  which  has  important  implications  for  both  the 
implementation  of  spatially  segmented  volume  holograms  in  real  time  materials,  as  well  as 
for  the  analysis  of  the  effects  of  macroscopic  space- variant  illumination  effects  in  the 
recording  of  full  aperture  volume  holograms. 
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3.  PUBLICATIONS  UNDER  DARPA/AFOSR  SPONSORSHIP 

The  following  technical  publications  and  conference  presentations  describe  research 

supported  in  part  by  this  contract. 

3.1  Journal  Publications 

1 .  D.  A.  Seery,  M.  H.  Garrett,  and  A.  R.  Tanguay,  Jr.,  "Electrooptic  Measurement  of 
the  Volume  Resistivity  of  Bismuth  Silicon  Oxide  (Bii2Si02o)"»  Journal  of  Crystal 
Growth,  85,  282-289,  (1987). 

2.  A.  Marrakchi,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  "Polarization  Properties  of 
Enhanced  Self-Diffraction  in  Sillenite  Crystals",  IEEE  Journal  of  Quantum 
Electronics,  Special  Issue  on  Electrooptic  Materials  and  Devices,  QE-23(12),  2142- 
2151,  (1987). 

3.  R.  V.  Johnson  and  A.  R.  Tanguay,  Jr.,  "Stratified  Volume  Holographic  Optical 
Elements",  Optics  Letters,  13(3),  189-191,  (1988). 

4.  A.  R.  Tanguay,  Jr.,  "Physical  and  Technological  Limitations  of  Optical  Information 
Processing  and  Computing",  Materials  Research  Society  Bulletin,  Special  Issue  on 
Photonic  Materials,  XIII(8),  36-40,  (1988);  (Invited  Paper). 

5.  R.  V.  Johnson  and  A.  R.  Tanguay,  Jr.,  “Stratified  Volume  Holographic  Optical 
Elements”,  Optics  News,  Special  Issue  on  “Optics  in  1988”,  14(12),  30-31,  (1988); 
(Invited  Paper). 

6.  G.  P.  Nordin,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  "Physical  Characterization 
of  Stratified  Volume  Holographic  Optical  Elements",  in  preparation  for  Optics 
Letters. 

7.  G.  P.  Nordin,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  “Diffraction  Properties  of 
Stratified  Volume  Holographic  Optical  Elements”,  in  preparation  for  Journal  of  the 
Optical  Society  of  America. 
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8.  P.  Asthana  and  A.  R.  Tanguay,  Jr.,  "Charge-Screening-Induced  Switching  in 
Spatially  Multiplexed  Sub-Holograms  in  Bii2SiO20’\  in  preparation  for  Applied 
Physics  Letters. 

9.  P.  Asthana,  G.  P.  Nordin,  A.  R.  Tanguay,  Jr.,  and  B.  K.  Jenkins,  “Beam 
Degeneracy  Crosstalk  in  Fan-Out/Fan-In  Volume  Holographic  Interconnections”,  in 
preparation  for  Optics  Letters. 

10.  P.  Asthana,  G.  P.  Nordin,  A.  R.  Tanguay,  Jr.,  and  B  .K.  Jenkins,  “Analysis  and 
Minimization  of  Inter-channel  Crosstalk  in  Volume  Holographic  Interconnections  for 
Optical  Neural  Networks”,  in  preparation  for  Journal  of  the  Optical  Society  of 
America. 

11.  J.  H.  Rilum  and  A.  R.  Tanguay,  Jr.,  "Characterization  of  Optical  Memory  Discs  for 
Optical  Information  Processing  Applications",  in  preparation  for  Applied  Optics. 

12.  J.  H.  Rilum  and  A.  R.  Tanguay,  Jr.,  “Differential  Interferometric  Readout  Optical 
Memory  Disc  Spatial  Light  Modulators”,  in  preparation  for  Optics  Letters. 

13.  Z.  Karim  and  A.  R.  Tanguay,  Jr.,  "A  Bandpass  AR  Coating  Design  for  Bismuth 
Silicon  Oxide",  in  preparation  for  Applied  Physics  Letters. 

14.  Z.  Karim  and  A.  R.  Tanguay,  Jr.,  “A  Bandpass  AR  Coating  for  the  Photorefractive 
Materials  LiNb03,  BaTi03,  CdTe,  and  PLZT”,  in  preparation  for  Applied  Optics. 

15.  Z.  Karim,  C.  Kyriakakis,  and  A.  R.  Tanguay,  Jr.,  “Improved  Two-Beam  Coupling 
Gain  and  Diffraction  Efficiency  in  Bismuth  Silicon  Oxide  Crystals  Using  a  Bandpass 
AR  Coating”,  in  preparation  for  Applied  Physics  Letters. 

16.  C.  Kyriakakis,  Z.  Karim,  J.  H.  Rilum,  J.  J.  Jung,  A.  R.  Tanguay,  Jr.,  and  A. 
Madhukar,  “Fundamental  and  Technological  Limitations  of  Asymmetric  Cavity 
MQW  InGaAs/GaAs  Spatial  Light  Modulators”,  in  preparation  for  IEEE  Journal  of 
Quantum  Electronics. 

17.  E.  J.  Herbulock  and  A.  R.  Tanguay,  Jr.,  "Electric  Field  Profile  Effects  on 
Photorefractive  Grating  Formation  in  Bismuth  Silicon  Oxide",  in  preparation  for 
Applied  Physics  Letters. 
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3.2  Book  Chapters 

1.  J.  Yu,  D.  Psaltis,  A.  Marrakchi,  A.  R.  Tanguay,  Jr.,  and  R.  V.  Johnson, 
"Photorefractive  Incoherent-to-Coherent  Optical  Conversion,"  in  Photo  refractive 
Materials  and  Applications,  J.  P.  Huignard  and  P.  Gunter,  Eds.,  Springer-Verlag, 
New  York,  (1989). 

2.  R.  V.  Johnson  and  A.  R.  Tanguay,  Jr.,  "Fundamental  Physical  Limitations  of  the 
Photorefractive  Grating  Recording  Sensitivity",  in  Optical  Processing  and 
Computing,  H.  Arsenault,  T.  Szoplik,  and  B.  Macukow,  Eds.,  Academic  Press, 
New  York,  (1989). 

3.  B.  K.  Jenkins  and  A.  R.  Tanguay,  Jr.,  “Photonic  Implementations  of  Neural 
Networks”,  Chapter  15  in  Neural  Networks  and  Fuzzy  Systems:  A  Dynamical 
Approach  to  Machine  Intelligence,  B.  Kosko,  Ed.,  Prentice  Hall,  Englewood  Cliffs, 
New  Jersey,  (1991). 

3.3  Patents 

1.  “Incoherent/Coherent  Multiplexed  Holographic  Recording  for  Photonic 
Interconnections  and  Holographic  Optical  Elements”,  B.  K.  Jenkins  and  A.  R. 
Tanguay,  Jr.,  patent  pending. 

3.4  Conference  Presentations 

1.  A.  R.  Tanguay,  Jr.,  "Optical  Information  Processing  Components",  Georgia 
Institute  of  Technology,  Atlanta,  Georgia,  (1987);  (Invited  Colloquium). 

2.  A.  R.  Tanguay,  Jr„  and  R.  V.  Johnson,  "Stratified  Volume  Holographic  Optical 
Elements",  Conference  on  Lasers  and  Electro-Optics,  Baltimore,  Maryland,  ( 1987). 

3.  A.  R.  Tanguay,  Jr.,  "Fundamental  Physical  Limitations  of  Optical  Information 
Processing  and  Computing",  NATO  Collaborative  Award  Colloquium,  University 
College  London,  London,  England,  (1987);  (Invited  Colloquium). 

4.  A.  R.  Tanguay,  Jr.,  "Optical  Information  Processing  Components",  GEC  Research, 
Marconi  Research  Centre,  Chelmsford,  England,  (1987);  (Invited  Colloquium). 
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5.  A.  R.  Tanguay,  Jr.,  "Fundamental  Physical  Limitations  of  Optical  Information 
Processing  and  Computing",  British  Telecom  Research  Laboratories,  Ipswich, 
England,  (1987);  (Invited  Colloquium). 

6.  A.  R.  Tanguay,  Jr.,  "Optical  Information  Processing  Components",  University  of 
California  at  Santa  Barbara,  Santa  Barbara,  California,  (1987);  (Invited 
Colloquium). 

7.  A.  R.  Tanguay,  Jr.,  "Fundamental  Physical  Limitations  of  Optical  Information 
Processing  and  Computing",  University  of  Rochester,  Institute  of  Optics, 
Rochester,  New  York,  (1987);  (Invited  Colloquium). 

8.  M.  H.  Garrett,  D.  A.  Seery,  and  A.  R.  Tanguay,  Jr.,  "Electrooptic  Measurement  of 
the  Volume  Resistivity  of  Bismuth  Silicon  Oxide  (Bii2SiO20)"»  American 
Conference  on  Crystal  Growth-7,  Monterey,  California,  (1987);  (Invited  Paper). 

9.  A.  R.  Tanguay,  Jr.,  "Optical  Processing  and  Computing  Devices:  The  Materials 
Perspective",  American  Conference  on  Crystal  Growth-7,  Monterey,  California, 
(1987);  (Invited  Paper). 

10.  A.  R.  Tanguay,  Jr.,  "The  Professor  Profession",  1987  Annual  Meeting  of  the 
Optical  Society  of  America,  Rochester,  New  York,  (1987);  (Invited  Presentation). 

11.  A.  R.  Tanguay,  Jr.,  "Optical  Information  Processing  Components",  Bell 
Communications  Research,  Morristown,  New  Jersey,  (1987),  (Invited 
Colloquium). 

12.  A.  R.  Tanguay,  Jr.,  "Optical  Information  Processing  Components",  IBM  Almaden 
Research  Center,  San  Jose,  California,  (1987);  (Invited  Colloquium). 

13.  A.  R.  Tanguay,  Jr.,  "Fundamental  Physical  Limitations  of  Optical  Information 
Processing  and  Computing",  DARPA  Panel  on  Neural  Networks,  California 
Institute  of  Technology,  Pasadena,  California,  (1987);  (Invited  Presentation). 

14.  A.  R.  Tanguay,  Jr.,  and  R.  V.  Johnson,  "Stratified  Volume  Holographic  Optical 
Elements",  IEEE-LEOS  Southern  California  Section  Winter  Regional  Meeting, 
University  of  Southern  California,  Los  Angeles,  California,  (1988). 
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15.  A.  R.  Tanguay,  Jr.,  "Photonic  Materials  and  Devices  for  Optical  Information 
Processing  and  Computing  Applications",  DARPA  Annual  Conference  on  Optical 
Processing  and  Computing,  Leesburg,  Virginia,  (1988);  (Invited  Paper). 

16.  A.  R.  Tanguay,  Jr.,  "Electro-Optical  Information  Processing  and  Computing 
Components",  Conference  on  Lasers  and  Electro-Optics,  Anaheim,  California, 
(1988);  (Invited  Paper). 

17.  A.  R.  Tanguay,  Jr.,  and  R.  V.  Johnson,  "Spatial  Light  Modulators  and  Volume 
Holographic  Optical  Elements",  Symposium  of  the  Center  for  the  Integration  of 
Optical  Computing,  University  of  Southern  California,  Los  Angeles,  California, 
(1988);  (Invited  Paper). 

18.  M.  H.  Garrett  and  A.  R.  Tanguay,  Jr.,  "Crystal  Growth  and  Characterization  of 
Nonstoichiometric  Bismuth  Silicon  Oxide  (BixSiOi.5x+2)",  AACG/West  Tenth 
Conference  on  Crystal  Growth,  Fallen  Leaf  Lake,  California,  (1988). 

19.  C.  Kyriakakis,  P.  Asthana,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  "Spatial  Light 
Modulators:  Fundamental  and  Technological  Issues",  Optical  Society  of  America 
Topical  Meeting  on  Spatial  Light  Modulators,  Lake  Tahoe,  Nevada,  (1988);  (Invited 
Paper). 

20.  S.  Mroczkowski  and  A.  R.  Targuay,  Jr.,  "Impurity  Induced  Photochromic 
Behavior  in  Bismuth  Silicon  Oxide  (Bii2Si02o)".  American  Conference  on  Crystal 
Growth/East-2,  Atlantic  City,  New  Jersey,  (1988). 

21.  G.  P.  Nordin,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  "Physical  Characterization 
of  Stratified  Volume  Holographic  Optical  Elements",  1988  Annual  Meeting  of  the 
Optical  Society  of  America,  Santa  Clara,  California,  Vol.  11  of  the  1988  OSA 
Technical  Digest  Series,  p.  106,  (1988). 

22.  M.  H.  Garrett,  S.  W.  McCahon,  and  A.  R.  Tanguay,  Jr.,  "Crystal  Growth  and 
Characterization  of  Nonstoichiometric  Bismuth  Silicon  Oxide,  Bii2SiOi.5X+2",  1988 
Annual  Meeting  of  the  Optical  Society  of  America,  Santa  Clara,  California,  Vol.  1 1 
of  the  1988  OSA  Technical  Digest  Series,  p.  106,  (1988). 
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23.  P.  Asthana  and  A.  R.  Tanguay,  Jr.,  "Charge-Screening-Induced  Switching  in 
Spatially  Multiplexed  Sub-Holograms  in  Bii2Si02o">  1988  Annual  Meeting  of  the 
Optical  Society  of  America,  Santa  Clara,  California,  Vol.  1 1  of  the  1988  OSA 
Technical  Digest  Series,  p.  151,  (1988). 

24.  J.  H.  Rilum  and  A.  R.  Tanguay,  Jr.,  "Utilization  of  Optical  Memory  Discs  for 
Optical  Information  Processing  Applications",  1988  Annual  Meeting  of  the  Optical 
Society  of  America,  Santa  Clara,  California,  Vol.  11  of  the  1988  OSA  Technical 
Digest  Series,  p.  43,  (1988). 

25.  Z.  Karim,  M.  H.  Garrett,  and  A.  R.  Tanguay,  Jr.,  "A  Bandpass  AR  Coating  Design 
for  Bismuth  Silicon  Oxide",  1988  Annual  Meeting  of  the  Optical  Society  of  America, 
Santa  Clara,  California,  Vol.  11  of  the  1988  OSA  Technical  Digest  Series,  p.  125, 
(1988). 

26.  A.  Marrakchi,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  "Polarization  Properties  of 
Enhanced  Self-Diffraction  in  Sillenite  Crystals",  1988  Annual  Meeting  of  the  Optical 
Society  of  America,  Santa  Clara,  California,  Vol.  1 1  of  the  1988  OSA  Technical 
Digest  Series,  p.  107,  (1988). 

27.  E.  J.  Herbulock,  M.  H.  Garrett,  and  A.  R.  Tanguay,  Jr.,  "Electric  Field  Profile 
Effects  on  Photorefractive  Grating  Formation  in  Bismuth  Silicon  Oxide",  1988 
Annual  Meeting  of  the  Optical  Society  of  America,  Santa  Clara,  California,  Vol.  1 1 
of  the  1988  OSA  Technical  Digest  Series,  p.  143,  (1988). 

28.  S.  Mroczkowski  and  A.  R.  Tanguay,  Jr.,  "Crystal  Growth  and  Impurity  Induced 
Photochromic  Behavior  in  Bismuth  Silicon  Oxide  (Bii2SiO20) The  Eighth  National 
Conference  on  Crystal  Growth  and  Materials,  Guilin,  China,  (1988). 

29.  G.  P.  Nordin,  R.  V.  Johnson,  and  A.  R.  Tanguay,  Jr.,  "Physical  Characterization 
of  Stratified  Volume  Holographic  Optical  Elements",  Signal  and  Image  Processing 
Institute  Annual  Research  Review,  University  of  Southern  California,  Los  Angeles, 
California,  (1989). 
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30.  P.  Asthana  and  A.  R.  Tanguay,  Jr.,  "Charge-Screening-Induced  Switching  in 
Spatially  Multiplexed  Sub-Holograms  in  Bij2Si02o".  Signal  and  Image  Processing 
Institute  Annual  Research  Review,  University  of  Southern  California,  Los  Angeles, 
California,  (1989). 

31.  M.  Hibbs-Brenner,  S.  D.  Mukherjee,  M.  P.  Bendett,  and  A.  R.  Tanguay,  Jr., 
“Integrated  Optoelectronic  Cellular  Array  for  Fine-Grained  Parallel  Processing 
Systems”,  Proc.  OSA  Topical  Meeting  on  Optical  Computing,  Salt  Lake  City,  Utah, 
(1989). 

32.  A.  R.  Tanguay,  Jr.,  "Device  Development  for  Optical  Computing",  Conference  on 
Lasers  and  Electro-Optics  (CLEO  ’89),  Baltimore,  Maryland,  (1989);  (Invited 
Paper). 

33.  A.  R.  Tanguay,  Jr.,  “Photonic  Materials  and  Devices  for  Optical  Information 
Processing  and  Computing  Applications",  DARPA  Annual  Conference  on  Optical 
Processing  and  Computing,  Airlie,  Virginia,  (1989). 

34.  Z.  Karim,  C.  Kyriakakis,  and  A.  R.  Tanguay,  Jr.,  "Improved  Two-Beam  Coupling 
Gain  and  Diffraction  Efficiency  in  Bismuth  Silicon  Oxide  Crystals  Using  a  Bandpass 
AR  Coating",  1989  Annual  Meeting  of  the  Optical  Society  of  America,  Orlando, 
Florida,  Vol.  18  of  the  1989  OSA  Technical  Digest  Series,  p.  29,  (1989). 

35.  Z.  Karim  and  A.  R.  Tanguay,  Jr.,  "Bandpass  AR  Coating  for  the  Photorefractive 
Materials  LiNb03,  BaTi03,  CdTe,  and  PLZT",  1989  Annual  Meeting  of  the  Optical 
Society  of  America,  Orlando,  Florida,  Vol.  18  of  the  1989  OSA  Technical  Digest 
Series,  p.  78,  (1989). 

36.  J.  H.  Rilum  and  A.  R.  Tanguay,  Jr.,  "Performance  Characteristics  of  Optical 
Memory  Disc  Spatial  Light  Modulators",  1989  Annual  Meeting  of  the  Optical 
Society  of  America,  Orlando,  Florida,  Vol.  18  of  the  1989  OSA  Technical  Digest 
Series,  pp.  171-172,  (1989). 

37.  B.  K.  Jenkins,  G.  C.  Petrisor,  S.  Piazzolla,  P.  Asthana,  and  A.  R.  Tanguay,  Jr., 
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ELECTROOPTIC  MEASUREMENT  OF  THE  VOLUME  RESISTIVITY 
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Single  crxstalx  of  bismuth  silicon  oxuJe  iBi,.Si02u)  and  its  ixomorphs  (including,  for  example,  bismuth  germanium  oxide 
i  Bi|;GeO;„  H  have  been  utilized  in  a  wide  range  of  active  eleetrooptic  and  acou.stoopic  devices,  including  the  Pockels  Readout  ( ipiicai 
Modulator  iPROM).  the  PRIZ.  the  Photorefractive  Incoherent-uvCoherent  Optical  Converter  (P1COC).  volume  holoaraphic  storage 
devices,  and  surface  acoustic  wave  devices  A  kev  materia!  parameter  that  influences  device  performance  characteristics  is  the  volume 
resistivity.  which  is  difficult  to  measure  accurately  using  standard  techniques  in  refractors  oxides  like  BirSiO.„  due  to  its  large 
magnitude  ttvptcallv  >  10' 1  12  cm).  We  present  here  a  technique  for  the  measurement  of  such  vers  high  resistivities  in  electr><optic 
materials,  this  method  utilizes  the  electrooptic  modulation  induced  hv  a  voltage  placed  across  the  (crvsiallographicallv  oriented  i 
sample  as  a  probe  of  temporal  voltage  transients  that  are  in  tum  directly  related  to  the  sample  volume  resistivitv  In  our  experiment 
a  verv  weak  optical  probe  is  frequency  modulated,  phase  detected,  and  emploved  at  low  dutv  cvcle  to  avoid  ambiguities  due  to 
photoconductive  voltage  decay  The  technique  is  described  in  detail,  and  experimental  results  are  presented  on  a  number  of  undoped 
and  doped  samples  of  bismuth  silicon  oxide  grown  by  the  Czochralski  technique 


1.  Introduction 

Bismuth  silicon  oxide  is  a  wide  bandgap.  high 
resistivity  semi-insulator  that  is  also  photoconduc¬ 
tive.  electrooptic,  acoustooptic,  magnetooptic,  and 
optically  active.  As  such,  it  has  found  widespread 
application  in  optical  information  processing  and 
computing  components  such  as  spatial  light  mod¬ 
ulators  and  volume  holographic  optical  elements 
[1].  Examples  of  such  devices  include  the  Pockels 
Readout  Optical  Modulator  (PROM)  [2,3],  the 
Photorefractive  Incoherent-to-Coherent  Optical 
Converter  (PICOC)  [4],  the  Optically  Modulated 
Total  Internal  Reflection  Spatial  Light  Modulator 
(OM-TIR  SLM)  [5],  and  photorefractive  volume 
holographic  optical  elements  (VHOEs)  (1.6]. 

Several  important  operational  parameters  of 
both  eleetrooptic  spatial  light  modulators  and 
volume  holographic  optical  elements  depend  di¬ 
rectly  on  the  volume  resistivity  (p)  of  the  active 
eleetrooptic  material.  For  example,  the  dielectric 
relaxation  time  ( r  =  p<«„,  in  which  <  is  the  relative 
dielectric  permittivity  of  the  material  and  «0  is  the 
permittivity  of  free  space)  establishes  the  frame 
storage  time  in  the  PROM.  PICOC,  and  OM-TIR 


spatial  light  modulators,  as  well  as  the  maximum 
frame  integration  time.  In  addition,  the  dielectric 
relaxation  time  determines  the  holographic  grating 
storage  time  in  Bi,;SiO;o  volume  holographic  opti¬ 
cal  elements.  Furthermore,  spatial  nonuniformiues 
in  the  resistivity  will  produce  undesirable  space- 
variant  erasure  and  image  decay  characteristics 

Determination  of  the  volume  resistivity  also  has 
significant  implications  for  materials  characteriza¬ 
tion.  Just  as  in  the  case  of  semiconductor  materials, 
the  as-grown  resistivity  provides  an  indication  of 
both  overall  crystal  punty  and  crystallographic 
perfection,  as  well  as  potential  for  correlation  with 
impurity  and  dopant  analysis  techniques.  This 
latter  point  is  especially  significant,  since  such 
techniques  are  at  present  not  well  developed  for 
low  impurity/dopant  levels  in  dielectric  matrices. 
In  fact,  resistivity  variations  can  yield  indirect 
information  regarding  the  nature  of  states  (shal¬ 
low  level,  deep  level,  recombination  center)  in¬ 
duced  by  specific  selected  incorporants.  Finally, 
the  dark  resistivity  provides  a  critical  baseline 
against  which  the  photoconductive  sensitivity  can 
be  evaluated. 

Traditional  techniques  for  measuring  resistm- 
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ties  of  order  lu:  ‘  U  cm  and  greater.  such  as  direct 
measurement  of  the  current  voltage  relationship, 
use  of  the  four-point  probe  (71.  and  Van  der 
Paul's  method  [X|.  are  impractical  due  to  the 
resultant  high  voltages  and  extremely  low  currents 
implied  b\  the  total  sample  resistance.  Several 
other  difficulties  bear  on  the  measurement  of  verx 
high  resistivities,  including  the  surface  conductiv¬ 
ity  of  the  specimen  crystal  and  crystal  mount,  the 
possible  influence  of  humidity-enhanced  surface 
conduction,  and  the  dependence  of  the  em¬ 
pirically-derived  dark  resistivity  on  the  optical  and 
thermal  exposure  historv  of  the  sample. 

We  have  utilized  an  electrooptic  measurement 
technique  as  described  herein  that  largely  cir¬ 
cumvents  these  difficulties,  in  order  to  evaluate 
the  dark  volume  resistivities  of  both  undoped  and 
doped  samples  of  single  crystal  bismuth  silicon 
oxide.  The  technique  is  applicable  to  a  wide  range 
of  electrooptic  refractory  oxides,  such  as  bismuth 
germanium  oxide  ( Bi ,  ;GeO:o ).  bismuth  titanium 
oxide  (Bii:TiO:o).  barium  titanate  (BaTiO, ). 
lithium  niobate  (LiNbOj,  and  strontium  barium 
niobate  (Sr,  Ba,  _  ,  Nb-,06 ). 

The  theoretical  basis  of  the  experimental  tech¬ 
nique  is  presented  in  section  2.  and  the  details  of 
the  method  are  described  in  section  3.  Results  of 
resistivity  measurements  on  representative  un¬ 
doped  and  doped  single  crystals  of  Bii;SiO20  are 
given  in  section  4.  Discussion  and  conclusions  are 
provided  in  section  5. 

2.  Theoretical  considerations 

The  essence  of  the  measurement  technique  is 
the  formation  of  a  parallel  plate  capacitor  with  the 
specimen  electrooptic  single  crystal  acting  as  the 
dielectric,  by  depositing  semi-transparent  counter- 
electrodes  on  two  optically  polished,  parallel  faces 
of  the  sample.  The  electrooptic  effect  is  then 
utilized  to  provide  a  direct  measurement  of  the 
instantaneous  voltage  applied  across  the  sample. 
A  high  voltage  is  first  applied  to  the  sample  in 
order  to  charge  the  capacitor  to  an  initial  voltage 
V0,  and  is  then  disconnected  to  allow  the  resultant 
transient  voltage  decay  of  the  sample  capacitor  in 
parallel  with  its  internal  resistance  to  be  moni¬ 


tored.  The  time  constant  of  the  voltage  decav 
provides  a  direct  measurement  of  the  resistivnv  oi 
the  sample  independent  of  geometric  factors,  as 
outlined  below. 

Consider  a  single  crystal  sample  of  Bi  Si() 
oriented  along  the  [001]  direction  as  shown  sche¬ 
matically  in  fig.  1.  Bismuth  silicon  oxide  is  a  cubic 
(123)  non-centrosymmetric  crystal,  characterized 
by  a  single  value  of  the  electrooptic  coefficient 
/•j,  =  r<:  =  Hence,  a  voltage  V  =  d  applied 
across  the  transparent  conductive  electrodes  (in 
which  d  is  the  sample  thickness/  will  induce  prin¬ 
cipal  axes  along  the  [110]  and  (110]  directions, 
with  indices  of  refraction  given  by  [9]: 


and 

"inoi  =  "o  ^  ^./ai^oou- 

In  eqs.  (1)  and  (2)  above,  the  natural  optical 
activity  of  bismuth  silicon  oxtde  is  neglected  (as 
will  be  discussed  further  below).  Consider  further 
propagation  of  an  optical  probe  beam  in  the  (001] 
direction,  polarized  along  the  [OlOj  direction.  The 
two  resultant  principal  components  will  expen- 
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Fig.  1.  Typical  electrooptic  configuration  for  measurement  of 
the  volume  resistivity  of  bismuth  silicon  oxide  (Bii;SiO.„> 
Both  crystallographic  and  electnc-fteld-induced  principal  axes 
are  shown  for  a  sample  oriented  along  the  (001  j  direction 
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ence  a  net  phase  shift  given  by: 

/'  =  ^r- Sn  J  =  (3) 

Nine  that  in  this  longitudinal  electrooptic  config¬ 
uration.  the  phase  difference  depends  only  on  the 
applied  voltage,  and  not  on  the  internal  electric 
field.  For  an  analyzer  oriented  along  the  (100] 
direction,  the  resultant  transmitted  intensity  will 
he: 

I  =  /,,  sin'l  /',  2)  =  sin'l  ^■rjt',r4)F' ).  (4) 

This  expression  illustrates  the  one-to-one  corre¬ 
spondence  between  the  transmitted  intensity 
through  the  system  and  the  instantaneous  voltage 
across  the  crystal  (provided  that  the  applied  volt¬ 
age  is  always  less  than  the  half  wave  voltage 

l  _  X  ). 

The  transient  voltage  decay  of  a  resistor  in 
parallel  with  a  capacitor  is  given  by 

!'(/!=  e\p<  -it).  (5) 

in  which  t  iv  the  characteristic  RC  time  constant. 
For  a  parallel  piate  capacitor  of  sufficiently  high 
aspec.  ratio  to  avoid  significant  field  fringing  ef¬ 
fects.  in  R( '  product  is  approximately  equal  to 
the  dielectric  relaxation  time  which  is  thus 

independent  of  geometric  parameters  Since  the 
relative  dielectric  permittivitv  of  Bii:SiO:„  is 
known  f<  =  56  [10]).  measurement  of  the  voltage 
decay  time  constant  directly  yields  the  desired 
volume  resistivity. 

Two  issues  are  particularly  worthy  of  note  at 
this  point  in  the  discussion.  First  and  foremost, 
the  measurement  technique  utilized  herein  (as  do 
all  resistivity  measurement  techniques)  assumes  a 
linear,  or  at  least  piecewise  linear,  current-voltage 
relationship  As  will  be  shown  in  section  4.  bis¬ 
muth  silicon  oxide  samples  exhibit  significant  non¬ 
linear  current-voltage  behavior  in  the  high  field 
regime  (  »1()'  V  cmi  Flence  the  measurement 
described  herein  is  more  accurately  a  measure  of 
I  he  differential  volume  (dark)  resistivity  at  the 
specified  initial  value  of  applied  voltage  l],.  We 
have  chosen  a  value  of  l',  (1600  V)  winch  is  of 
particular  relevance  to  the  device  applications  dis¬ 
cussed  m  section  1  The  implications  of  this  point 


will  be  discussed  further  in  sevion  4. 

The  second  issue  concerns  the  absolute  onen- 
.ation  of  the  single  crystal  sample,  the  effects  of 
finite  absorption  coefficients,  and  the  natural  opti¬ 
cal  activity  of  BiuSiO;„.  As  described  in  section  3. 
an  essentially  arbitrarily  oriented  sample  with  anv 
combination  of  linear  and  nonlinear  optical  prop¬ 
erties  can  be  easily  accommodated  by  the  simple 
device  of  obtaining  a  calibration  curve  of  trans¬ 
mitted  intensity  as  a  function  of  voltage  during 
the  initial  charging  of  the  capacitor.  The  onlv 
requirement  is  that  a  measurable  fraction  of  the 
transmitted  light  experience  an  electrooptic  (or 
electrorefractive)  effect  such  that  a  monotonic 
calibration  curve  can  be  established.  This  further 
eliminates  any  experimental  dependence  on  the 
measUiement  wavelength,  the  electrooptic  coeffi¬ 
cient.  the  index  of  refraction,  the  crystal  thickness, 
the  angle  of  incidence,  and  the  optical  rotators 
power. 

Finally,  a  similar  resistivity  measurement  tech¬ 
nique  has  been  described  previously  [11]  in  which 
the  sample  to  be  measured  is  connected  in  parallel 
with  an  electroopnc  crystal,  which  in  turn  is 
utilized  to  measure  the  voltage  decav.  This  tech¬ 
nique  is  applicable  for  the  measi  '•ement  of  resis¬ 
tivities  small  compared  with  that  of  the  calibrated 
sample. 

3.  Experimental  procedure 

The  basic  configuration  utilized  for  the  mea¬ 
surement  of  high  volume-resistivities  in  electro¬ 
optic  crystals  is  as  shown  in  fig.  2.  The  sample  to 
be  measured  is  typically  oriented  as  shown  in  fig 
1  and  mounted  in  an  -environmental  chamber  to 
allow  control  of  the  ambient  atmosphere.  The 
chamber  is  flushed  with  filtered  ultra  high  puritv 
drv  V  gas  prior  to  and  during  the  mea  urement 
procedure  in  order  to  minimize  adsorption  of 
surface  moisture  and  contaminants  Incident  probe 
light  derived  from  a  p>  L.ri/ed  helium  neon  laser 
(chosen  to  minimize  the  photocondi  .tive  decav  of 
the  bismuth  silicon  oxide  samples)  is  spatiallv 
filtered,  collimated,  apertured.  and  passed  through 
a  final  polarizer  before  transmittal  through  the 
sample  at  near  normal  incidence  and  i  polari/u- 
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Fig.  :.  The  expenmemal  configuralion  Inr  measurement  of  volume  resislivitv  in  electronic  materials.  as  described  in  detail  in 

action  3 


tion  analyzer.  Light  passing  the  analyzer  is  de¬ 
tected  bv  a  silicon  photodetector,  which  provides 
the  signal  channel  input  to  a  phase  sensitive  detec¬ 
tor  (Pnncet'  Applied  Research  Model  126  Lock- 
In  Amplifier).  The  reference  output  of  the  lock-m 
is  used  to  drive  an  optical  chopper  through  a 
controller  to  complete  a  phase-locked  loop.  This 
allows  operation  at  very  low  signal  amplitudes 
(incident  intensities),  again  minimizing  photocon- 
ductive  decay  of  the  sample  voltage.  The  average 
incident  power  during  the  course  of  the  measure¬ 
ment  is  further  reduced  by  utilization  of  neutral 
density  filters  and  a  computer  controlled  optical 
shutter  that  remains  closed  at  all  times  except 
during  brief  measurement  intervals. 

Voltage  is  applied  to  the  crystal  by  means  of  a 
programmable  high  voltage  power  supply  through 
a  single  pole,  single  throw  knife  switch.  Both  a 
mercury  relay  and  a  high  voltage  contactor  were 
employed  in  trial  runs  in  the  hopes  of  implement¬ 
ing  full  computer  control  of  the  voltage  disconnect 
sequence,  but  proved  to  have  intolerably  high 
parasitic  capacitances.  The  knife  switch  was  auto¬ 
mated  for  computer  control  by  incorporation  of  a 
long  throw  solenoid  interfaced  to  the  microcom¬ 
puter. 

In  operation,  the  microcomputer  increases  the 


voltage  across  the  sample  in  increments,  recording 
the  transmitted  intensity  at  each  incremental  volt¬ 
age.  This  produces  the  requisite  calibration  curse 
for  direct  interpretation  of  the  voltage  transient 
decay,  as  descnbed  in  section  2.  Examples  of  such 
calibration  curves  are  shown  in  fig.  3  for  the  four 
samples  employed  in  this  study.  Following  com¬ 
pletion  of  the  calibration  sequence,  the  optical 
shutter  is  automatically  closed,  and  the  knife 
switch  automatically  opened  by  means  of  the  elec¬ 
tromechanical  solenoid.  The  microcomputer  con¬ 
trols  subsequent  data  acquisition  during  the  decav 
transient,  controlling  the  shutter,  recording  the 
transmitted  intensity,  and  recording  the  instanta¬ 
neous  laser  power  by  means  of  a  reference  photo¬ 
diode  for  purposes  of  normalization.  Voltage  de¬ 
cay  measurements  weie  typically  recorded  every 
15  min.  with  five  independent  measurements  of 
the  intensity  at  a  given  time  averaged  to  form  each 
plotted  data  point.  Dependent  on  the  decay  time 
constant  to  be  measured,  voltage  decay  runs  vaned 
in  duration  from  15  min  to  over  one  hundred 
hours. 

A  typical  senes  of  voltage  decay  curves  are 
plotted  in  fig.  4  for  one  of  the  undoped  Bi,;SiO;il 
samples.  Four  curves  are  shown,  for  average  power 
levels  during  measurement  of  2  jiW.  200  nW.  20 
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Fig  )  Electrooptic  calibration  curves  for  the  four  samples 
utilized  in  this  studs  The  uuancits  T  is  the  tmensits  transmis¬ 
sion  coefficient  of  each  sample  at  the  probe  wavelength,  uncor- 
rected  for  reflection  losses. 


f  ig  4  (  haractenstic  voltage  decas  curses  for  undoped  sample 
BSO-M.  shossing  the  dependence  of  the  decas  time  constant 
on  ihe  aserjge  incident  power  level  over  the  time  span  of  the 
measurement 


nW'.  and  20  pW.  The  first  three  curses  show  the 
effects  of  enhanced  photoconductive  decas.  while 
the  20  pW  decas  curse  showed  little  sensitivits  to 
continued  decreases  in  average  power  level,  and 
hence  can  he  considered  representative  of  the  dark 
resistivits . 

It  should  be  noted  that  bismuth  silion  oxide 
exhibits  the  effects  of  multiple  trap  levels  [10],  and 
as  such  the  occupant's  state  of  these  traps  prior  to 
measurement  will  affect  the  obtained  values  of  the 
dark  resistivity.  It  has  further  been  observed,  for 
example,  that  pre-illumination  with  IR  can  signifi¬ 
cantly  alter  the  trap  occupancy  and  dark  conduc¬ 
tivity  [12).  In  order  to  standardize  the  initial  con¬ 
ditions  in  a  manner  representative  of  the  operating 
conditions  of  most  spatial  light  modulators  and 
volume  holographic  optical  elements,  the  samples 
were  pre-illuminated  with  intense  broad  band 
radiation  while  the  counterelectrodes  were  shorted 
together.  This  serves  the  further  purpose  of  uni¬ 
formly  distributing  any  accumulated  space  charge 
prior  to  measurement. 

Careful  sample  preparation  was  found  to  be 
critical  in  obtaining  consistently  repeatable  re¬ 
sults.  Single  crystal  samples  were  oriented  bv  Laue 
back  reflection  X-ray  diffraction  techniques,  cut. 
and  polished  to  better  than  A/4.  The  samples 
were  then  ultrasonicallv  cleaned  in  semiconductor 
grade  TCE.  acetone,  and  methanol,  followed  by  a 
thorough  rinse  in  18  Mi?  cm  deionized  water. 
Indium  tin  oxide  (1TO)  transparent  counterelec¬ 
trodes  were  deposited  by  RF  magnetron  sputtering 
through  pattern-defining  masks.  Contact  to  the 
samples  was  made  through  opposing  conductive 
o-ring  gaskets,  and  in  some  cases  by  small  wires 
attached  to  the  ITO  electrodes  with  silver  paint. 
Between  measurements,  samples  were  kept  in  a 
dessicated  environment  to  avoid  adsorbed  mois¬ 
ture. 


4.  Resistivity  measurements 

Four  single  crystal  samples  of  Bti:SiO;„  were 
chosen  for  measurement,  representative  of  both 
undoped  and  doped  crystals  grown  from  the  melt 
by  the  Czochralski  technique  [9j.  The  four  crystal 
samples  and  their  characteristics  are  listed  in  table 
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T.ir.iutcristics 

Sample 

Description 

HSC  >-> 

1  luJ.-pcii 

S.ominall\  pure 

nsi  >.<i 

l  P.doped 

Nommallv  pure 

ns<  >->  * 

1  c  Ji'ped  i  f 

ppmi 

\ er\  low  level  dopant 

HSO-4r 

<  r  dt  >ped  1 1 

SCt  ppmi 

Intermediate  level  dopant 

1.  The  first  two  samples.  BSO-28  and  BSO-51.  are 
representative  of  undoped  single  crystals  grown 
from  different  starting  batches  of  five  9's  purity 
Bi  ,Q,  and  SiO;  under  similar  but  not  identical 
growth  conditions.  Sample  BSO-53  was  grown 
from  the  same  starting  melt  as  BSO-51.  with  the 
addition  of  5  ppm  (atomic)  iron  (from  iron  oxide) 
to  the  melt.  Sample  BSO-46  was  intentionally 
doped  with  180  ppm  (atomic)  chromium  in  the 
melt  (from  chromium  oxide).  Using  reported  val¬ 
ues  of  the  segregation  coefficients  in  Bi,;SiO;o 
(Fe:  <  0.05:  Cr:  1.8  [13]).  this  implies  crystalline 
doping  densities  of  approximately  250  ppb  or  less 
Fe  in  BSO-53  and  320  ppm  Cr  in  BSO-46.  The 
optical  quality  of  the  samples  was  determined  by 
careful  inspection  for  strain  induced  birefringence 
using  polarization  microscopy.  No  growth  induced 
strain  was  evident  in  any  of  the  samples. 

Characteristic  decay  curves  for  each  of  the  four 
samples  are  shown  in  fig.  5.  Sample  BSO-28  ex¬ 
hibited  the  shortest  time  constant  to  the  1/e  point 
(approximately  40  min),  and  Cr-doped  BSO-46 
exhibited  the  longest  (in  excess  of  100  h).  As 
discussed  in  section  2.  note  that  none  of  the 
voltage  decay  curves  shown  in  fig.  5  can  be  ade¬ 
quately  fit  by  a  straight  line  as  would  be  expected 
for  a  purely  exponential  decay.  This  is  a  clear 
indication  of  an  inherent  nonlinearity  in  the  cur¬ 
rent-voltage  characteristic  for  this  voltage  range. 
However,  the  calculated  variation  in  differential 
resistivity  along  each  of  the  curves  amounts  to  of 
order  a  factor  of  two. 

Resistivity  values  for  each  of  the  four  repre¬ 
sentative  samples  are  plotted  for  comparison  in 
fig.  6  as  a  function  of  the  average  incident  power 
level  over  the  time  span  of  the  measurement. 
These  values  were  calculated  from  the  first  two 
reliable  data  points  following  initiation  of  the 
voltage  decay  measurement  in  each  case,  and  as 


Fig.  5.  Characteristic  voltage  decav  curves  for  all  four  Bi  Sit  > 
samples,  all  at  an  average  incident  power  level  of  ;u  pW  over 
ihe  time  span  of  the  measurement 


such  closely  approximate  the  differential  resis¬ 
tivity  at  or  near  1600  V  for  each  sample.  Note  that 
each  curve  is  seen  to  saturate  with  decreasing 
average  incident  power,  indicating  a  valid  assign¬ 
ment  of  the  dark  resistivity  at  the  lower  limiting 
power  level.  Furthermore,  note  the  variation  in 
slopes  for  higher  incident  power  levels,  which  im¬ 
plies  a  corresponding  variation  in  sample  photo¬ 
conductivity  at  the  probe  wavelength  of  6328  A 
The  lowest  resistivity  sample  measured  is  BSO- 
28,  with  a  value  near  4  x  10u  Q  cm.  The  second 
undoped  sample  (BSO-51)  is  considerably  more 
resistive,  at  1.5  x  1015  12  cm.  This  variation  can  he 
attributed  either  to  differences  in  starting  batch 
impurity  levels  or  to  distinct  growth  conditions 
The  addition  of  only  5  ppm  Fe  to  the  melt  is  seen 
to  lower  the  resistivity  from  that  of  BSO-51  to  that 
of  BSO-53  (9  x  10 14  f2  cm.),  while  the  addition  of 
180  ppm  Cr  to  the  melt  produced  a  sample  with 
the  highest  resistivity  observed  among  our  samples 
to  date  (BSO-46:  1.9  x  16  S2  cm). 
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Fig  h  MeuMired  values  of  the  volume  resistivity  for  all  four 
Bil:SiO;„  samples  as  a  function  of  average  incident  power 
over  the  time  span  of  the  measurement. 


It  should  be  noted  that  the  Cr  doped  sample 
was  quite  unusual  in  that  it  proved  to  be  quite 
insensitive  to  considerable  levels  r  -  illumination. 
For  example,  with  1600  V  applied,  minutes  of 
exposure  to  bright,  broad  band  illumination  caused 
negligible  voltage  decay.  as  did  continual  exposure 
to  nominal  ambient  room  illumination  levels.  The 
combination  of  an  increased  resistivity  with  a 
significantly  diminished  photosensitivity  suggests 
that  Cr  may  form  an  active  recombination  center 
with  a  large  capture  cross  section  in  bismuth  sili¬ 
con  oxide.  Other  dopants  in  Bi12SiO20  have  also 
been  observed  to  both  decrease  the  photoconduc- 
tivitv  in  the  visible  spectrum  and  change  the 
volume  resistivity  [14], 

5.  Discussion  and  conclusions 

Several  device  implications  accrue  to  the  elec¬ 
trooptic  measurement  of  the  volume  resistivity  of 


bismuth  silicon  oxide,  particularly  with  regard  to 
its  potential  correlation  with  growth  and  doping 
conditions.  The  attainment  of  very  large  resistivity 
values  in  certain  electro-optic  single  crystals 
through  either  growth  modification  or  dopant  in¬ 
corporation  is  of  considerable  importance,  as  such 
values  imply  very  large  charge  storage  (frame) 
times  in  most  device  configurations.  In  addition, 
the  achievement  of  long  dielectric  relaxation  times 
implies  the  added  flexibility  of  decay-free  tem¬ 
poral  integration  processing  modes  in  both  opti¬ 
cally  addressed  electrooptic  spatial  light  modula¬ 
tors  and  photorefractive  volume  holographic  opti¬ 
cal  elements.  In  some  cases,  device  structures 
utilize  dielectric  blocking  layers  to  prevent  longi¬ 
tudinal  charge  transport  through  the  device.  An 
important  example  is  the  Pockels  Readout  Optical 
Modulator,  which  employs  vapor  deposited  pary- 
lene  organic  thin  films  as  high  resistivity,  high 
dielectric  breakdown  strength  blocking  layers.  In 
such  cases,  pre-determination  of  the  volume  resis¬ 
tivity  allows  the  effects  of  sub-optimum  dielectric 
blocking  layers  to  be  accurately  assessed  [15-17] 

Effects  of  growth  modification  and/or  dopant 
incorporation  on  the  photoconductive  response  at 
particular  wavelengths  can  also  have  considerable 
impact  for  potential  enhancements  of  device  per¬ 
formance.  For  example,  increased  photoconductiv¬ 
ity  at  specified  (writing)  wavelengths  optimizes  the 
exposure  sensitivity  of  optically  addressed  spatial 
light  modulators.  On  the  other  hand,  decreased 
photoconductivity  at  specified  (readout)  wave¬ 
lengths  reduces  the  potential  for  optical  damage  in 
single  and  multiple  channel  electrooptic  modula¬ 
tors.  and  can  also  increase  the  available  readout 
gain  for  several  classes  of  spatial  light  modulators. 

We  have  presented  herein  a  technique  for  accu¬ 
rate  and  repeatable  determination  of  the  volume 
resistivity  (and  photoconductivity)  of  electrooptic 
single  crystals.  The  technique  is  quite  straightfor¬ 
ward  to  implement,  is  nondestructive,  and  allows 
for  in  situ  variation  of  parameters  such  as  external 
illumination,  ambient  atmosphere,  and  ambient 
temperature.  In  addition,  the  technique  does  not 
require  accurate  crystallographic  orientation  or 
sample  alignment  for  successful  implementation 
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Polarization  Properties  of  Enhanced  Self-Diffraction 

in  Sillenite  Crystals 

A3DELLATIF  MARRAKCHI.  RICHARD  V  JOHNSON,  member,  ieee,  and 
ARMAND  R.  TANGL'AY.  Jr.,  member.  ieee 


Kburaa— Doppler-enhanced  self-diffraction  in  two-beam  coupling 
experiments  with  sillenite  crystals  exhibits  pronounced  optical  polar¬ 
ization  etfects  due  to  the  concomitant  presence  of  natural  optical  activ¬ 
ity  and  electric-heid-induced  linear  birefringence.  Coupled  wave  equa¬ 
tions  that  describe  the  polarization  properties,  exclusive  of  self- 
diffraction  effects,  have  previously  been  derived  for  the  two  principal 
crystal  orientations  most  commonly  used  in  photorefractive  recording. 
In  this  paper,  the  coupled  wave  equations  are  combined  with  a  linear¬ 
ized  model  of  the  photorefractive  recording  process  isingle  trap  level, 
single  mobile  charge  species!  to  analyze  the  impact  of  self-diffraction 
effects  on  the  polarization  state  evolution.  Numerical  solutions  of  these 
equations  y  ield  optimum  configurations  for  enhanced  gain  and  im¬ 
proved  image  contrast  in  two-ware  mixing  with  such  materials.  In  ad¬ 
dition.  inclusion  of  optical  activity  in  the  model  emphasizes  the  contri¬ 
bution  of  this  effect  to  the  apparent  reduction  of  the  effective 
eiectrooptic  coefficient  of  bismuth  silicon  oxide  crystals. 

I.  Introduction 

MOST  interesting  class  of  photorefractive  media  is 
that  of  the  siilenite  crystals,  which  includes  bismuth 
silicon  oxide  (Bi,;SiO;0.  or  BSO).  bismuth  germanium 
oxide  (Bi::CeO;o.  or  BGO).  and  bismuth  titanium  oxide 
( Bi,:TiO:o.  or  BTO)  [1].  These  crystals  exhibit  high  pho- 
torer'ractive  sensitivity  for  volume  holographic  grating 
formation  [I],  fast  response,  long  storage  times  under  dark 
conditions,  and  essentially  unlimited  recyclability.  Such 
crystals  are  potentially  useful  tor  dynamic  real-time  and 
time  average  interferometry  [3].  nonlinear  optical  signal 
processing  (-].  [5].  phase-corrected  image  propagation 
through  aberrating  media  [6],  optical  interconnections  [7], 
spatial  optical  switching  [8],  self-pumped  laser  resonators 
[9],  and  image  amplification  [10]. 

Light  diffraction  from  volume  holograms  in  sillenite 
crystals  exhibits  pronounced  polarization  properties,  en¬ 
compassing  the  effects  of  both  natural  optical  activity  and 
electric-field-induced  linear  birefringence.  A  detailed 
model  of  these  polarization  properties  is  critical  for  ob¬ 
taining  maximum  performance  in  photorefractive  appli- 
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cations.  For  example.  Hemau  e:  al.  [II],  [12]  have  dem¬ 
onstrated  a  technique  for  significantly  improving  :he 
holographic  image  quality  by  suppressing  spurious  scat¬ 
tered  light.  This  technique  relies  on  the  fact  that  each  of 
the  two  holographic  writing  beams  generally  evolves  into 
a  distinctly  different  polarization  state,  so  that  the  two 
beams  can  be  distinguished  by  a  polarization  analyzer. 

Numerous  studies  of  the  polarization  properties  have 
been  published  for  each  of  the  two  princiDal  crystallo¬ 
graphic  orientations  (Figs.  1  and  2)  in  which  the  presence 
of  optical  activity  is  neglected  [13]-[15],  The  effects  of 
optical  activity  have  been  incorporated  by  coupied  wave 
equation  techniques,  as  derived  by  the  authors  [16].  anc 
independently  by  Vachss  and  Hesseiink  [17],  which  de¬ 
scribe  transmissive  holographic  recording  geometries  :r. 
the  absence  of  self-diffraction  effects.  Mallick  er  al.  have 
derived  analytic  solutions  for  these  coupled  wave  equa¬ 
tions  in  the  limit  in  which  one  of  the  two  recording  beams 
remains  undepleted  as  it  propagates,  and  in  the  limit  at 
perfect  Bragg  matching  [18].  Kukhtarev  ei  al.  have  de¬ 
rived  a  set  of  coupled  wave  equations  which  includes  act." 
optical  activity  and  self-diffraction  effects,  and  have  ob¬ 
served  that  self-diffraction  can  significantly  modify  :n- 
polarization  states  exhibited  by  the  light  beams  [19]  Th. 
detailed  analysis  presented  in  [19]  emphasized  a  reflection 
holographic  recording  geometry,  but  the  coupled  wave 
equations  can  easily  be  recast  for  a  transmissive  geome¬ 
try’.  The  scope  of  the  present  paper  is  to  explore  the  im¬ 
pact  of  self-diffraction  effects  on  the  polarization  proper¬ 
ties  of  volume  holograms  in  sillenite  crystals  recorded  ir. 
a  transmissive  geometry  (as  shown  schematically  in  Fig 
3).  Operation  deep  within  the  Bragg  regime  is  assume-, 
throughout  the  following  analysis. 

Sel?-diffraction  refers  to  the  process  whereby  the  two 
writing  laser  beams,  which  interfere  to  form  the  photo¬ 
refractive  grating,  diffract  from  the  forming  grating, 
thereby  modifying  the  interference  fringe  profile  deeper 
within  the  crystal  [20]-[22].  This  effect  modifies  both  the 
modulation  depth  of  the  gracing  and  the  phase  of  the  fringe 
system.  The  result  is  a  temporal  and  spatial  evolution  cf 
the  grating  strength  and  phase,  which  eventually  stab  - 
lizes  in  the  steady-state  limit  to  a  spatially-varying  grating 
strength  and  phase.  Representative  steady-state  grating 
profiles  induced  by  self-diffraction  are  shown  in  Figs  i 
and  5.  based  upon  the  model  described  in  the  next  section. 
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Fti  I.  The  { Kc  !l  <001  )  !  crystal  orientation  of  bismuth  silicon  oxide 
i  Bi,.SiO;o,  or  BSO)  ior  volume  holograpny  The  dashed  line  represents 
s  normal  to  ihe  entrance  face.  The  .t.  y.  ind  ;  coordinate  system  shown 
to  the  left  of  the  crystal  is  assumed  in  the  coupled  wave  analysts.  The 
Y,0.  Y(a.  and  Zeo  axes  shown  on  the  crystal  face  refer  to  the  principal 
.•■ectraopiic  axes  induced  by  me  applied  bias  electric  held. 
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Fig.  2.  The  {  Kc  x  <  00 1  >  }  crystal  orientation  of  Bi^SiO^  for  volume 
holography. 


Fig.  3.  Optical  arrangement  for  Doppler-enhanced  two-beam  coupling,  in 
wmch  the  temporal  frequency  of  one  of  the  two  writing  laser  beams  is 
slightly  shifted  by  reflection  from  a  piezoelectncally  driven  minor,  cre¬ 
ating  a  moving  grating  within  the  photorefnetive  medium. 

Note  the  curious  structural  features  of  the  grating  profiles, 
especially  for  the  {Kc  X  <  001  >  }  orientation  (defined  in 
Fig.  2).  In  one  case,  the  grating  strength  stays  essentially 
uniform  for  the  conditions  considered,  but  the  grating 
phase  fronts  are  corrugated  rather  than  flat.  In  another 
case,  the  grating  phase  fronts  remain  flat  but  the  grating 
strength  exhibits  a  layered  strucrure. 

One  manifestation  of  self-diffraction  effects  can  be  en¬ 
ergy  coupling  between  the  two  recording  laser  beams,  in 
which  the  intensity  of  one  of  the  beams  can  be  amplified 
at  the  expense  of  the  intensity  in  the  second  beam.  This 
phenomenon  has  been  extensively  investigated  in  the  lit¬ 
erature  [10].  123]— [25] .  Maximum  energy  coupling  oc¬ 


-  'CCD. 


a.  i 


Fig  i  Seit-Jitf fiction  induced  growth  ot  me  iox-i  .rterct  lieu  t.ouc:. 
non  Jeptn  oj  o  function  of  crystal  jerin  for '  anou;  -o- ore  mg  .one  go 
rations.  A  bias  held  of  6  LV  cm  and  ins  material  rare rr.eierj  ior  3SC 
given  ;n  Table  i  are  assumed. 


Fig.  5.  Variation  of  the  space  charge  neld  phase  with  dertn  in  ihe  cry  sill 
resulting  from  seif-dilfraction  e.fects.  for  the  same  recording  .-pncnic.-.s 
assumed  in  the  previous  figure.  The  pnase  angle  plotted  nere  :s  reid.v.v 
to  the  phase  angle  at  the  entrance  face  of  the  crystal. 

curs  when  a  90°  phase  shift  exists  between  the  optical 
interference  pattern  and  the  resulting  space  charge  fieid. 
This  optimum  phase  shift  occurs  naturally  when  recording 
stationary  interference  patterns  in  the  diffusion  regime,  but 
typical  space  charge  fields  and  hence  energy  coupling  ef¬ 
fects  are  low  in  this  regime.  The  photosensitivity  can  be 
significantly  enhanced  by  applying  a  bias  electric  fieid  tas 
shown  in  Figs.  1-3),  but  the  phase  relationship  between 
a  stationary  light  interference  pattern  and  the  resulting 
space  charge  field  is  generally  not  optimum  for  energy 
coupling. 

At  least  two  techniques  have  been  demonstrated  for  en¬ 
hancing  the  seif-diffraction  process  for  energy  coupling 
applications.  One  technique  is  to  Doppler  shift  one  of  the 
recording  laser  beams,  for  example  by  reflecting  this  beam 
from  a  piezoelectrically-driven  constant  velocity  mirror, 
as  shown  in  Fig.  3  [10],  [23J-[25].  A  Doppler  shift  causes 
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;ne  ,ig.it  interference  patrem  :o  translate  with  speed 
w.nch  modifies  the  phase  between  the  optical  interference 
and  space  charge  field  profiles.  One  particular  grating 
speed  :-0Q,  exists  which  asserts  the  90°  phase  shift  needed 
:or  maximum  energy  coupling.  An  additional  benefit  of 
tne  Dopoier  technique  is  a  significant  increase  in  the  mag¬ 
nitude  at  tne  space  charge  field,  potentially  by  as  much 
as  an  order  or  magnitude  fsee  Fiz.  6.  which  is  described 
more  fuily  ;n  Section  11-3).  A  second  technique  for  en¬ 
hancing  energy  couoiing  is  to  record  a  stationary  interfer¬ 
ence  pattern  with  a  rapidly  reversing  applied  bias  fieid 
[25J. 

Seif-diffraction  effects  can  also  be  manifested  as  an  al¬ 
teration  of  the  polarization  states  of  the  two  writing  beams, 
as  shown  in  Figs.  7-18.  These  figures  compare  represen¬ 
tative  evolutions  of  polarization  angle  and  ellipticitv  for 
the  undepieted  pump  beam  and  for  the  signal  beams  as¬ 
sociated  with  three  alternative  recording  configurations. 
One  recording  configuration  involves  no  self-diffraction, 
but  rather  presumes  a  photo  refractive  grating  which  is 
uniform  in  both  amplitude  and  phase  throughout  the  vol¬ 
ume  of  the  hologram.  This  is  the  model  assumed  in  our 
previous  polarization  properties  analysis  [16].  as  well  as 
tnose  of  Vachss  and  Hesselink  [17],  and  Mallick  et  al 
[IS].  The  second  recording  configuration  incorporates 
self-diffraction  effects  associated  with  a  stationary  inter¬ 
ference  pattern,  and  the  third  configuration  incorporates 
self-diifraction  effects  associated  with  a  moving  interfer¬ 
ence  pattern  which  can  enhance  energy -coupling  in  sillen- 
ite  cry  stals  [10],  [23]-[25].  These  numerical  solutions  are 
reviewed  more  fully  in  Section  III- A. 

Figs.  7-18  demonstrate  graphically  that  self-diffraction 
effects  ha\  e  a  profound  impact  on  the  polarization  prop¬ 
erties  of  light  diffraction  from  volume  holograms  in  sil- 
lenite  crystals .  especially  when  techniques  such  as  Dopp¬ 
ler-shifting  are  used  to  enhance  the  self-diffraction.  A 
model  of  light  diffraction  in  sillenite  crystals  which  in¬ 
corporates  self-diffraction  effects  is  detailed  in  Section  II. 
and  the  sample  solutions  are  studied  in  detail  in  Section 
m-A.  To  understand  the  key  implications  of  the  numeri¬ 
cal  solutions,  fundamental  features  of  the  diffraction  an¬ 
isotropy  are  reviewed  in  Section  III-B.  Finally,  conclu¬ 
sions  are  drawn  from  the  analysis  in  Section  IV. 

n.  Description  op  the  Model  and  Numerical 
Method 

The  complete  model  of  light  diffraction  in  the  presence 
of  self-diffraction  effects  synthesizes  two  principal  com¬ 
ponents.  One  component  specifies  the  light  diffraction, 
coupling,  and  polarization  evolution  in  an  electrooptic 
medium  in  response  to  a  prescribed  quasi-static  electric 
field  pattern  E(x,  z).  The  second  component  specifies  the 
photorefractive  recording  of  a  space  charge  field  EK(x, 
z)  in  response  to  a  given  optical  intensity  profile  formed 
by  the  coherent  interference  of  two  intersecting  laser 
beams.  Each  component  is  explored  separately  below,  and 
the  numerical  algorithm  for  combining  them  is  then  de¬ 
scribed. 


Coupled  J'.  e  Equations  for  L  ’iu  Difracr.cn 

Tne  diffraction  of  light,  enersy  coupling,  ana  e"c;u:;on 
or  :he  polarization  states  for  two  intersecting  laser  aecrr.s 
in  a  volume  hologram  are  ail  governed  bv  a  set  of  coupied 
wave  equations,  one  set  for  each  of  the  two  principal  crys¬ 
tallographic  orientations.  These  two  sets  of  equations  have 
been  derived  and  discussed  in  detail  previously  [16]  and 
are  therefore  not  reproduced  herein.  The  parameters  ap¬ 
pearing  in  these  equations  include  natural  circular  bire¬ 
fringence  and  eiectnc-field-induced  linear  birefringence. 
The  electric  fieic  profile  Eix.  z)  in  steady  state  :s  as¬ 
sumed  :o  have  the  form 

Eix.  z )  =  £j  :  1  [£sc(d)  --P  U K^x) 

-  complex  conjugate]  f  ■  i 

in  which  £0  is -a  bias  electric  field  applied  to  the  crystal  to 
enhance  the  photosensitivity  (shown  in  Figs.  1-31.  and 
EiC(z)  is  the  space  charge  field  induced  by  the  photore¬ 
fractive  effect.  Higher  order  spatial  harmonics  of  the  soace 
charge  field  may  exist,  but  are  neglected  in  this  analysis 
because  these  harmonics  are  not  Bragg-matched  to  the  in¬ 
cident  light  beams. 

Given  a  particular  form  for  the  space  charge  fieic 
EiC(z).  the  coupied  wave  equations  can  be  reaaiiy  inte¬ 
grated  to  give  the  light  intensities  and  polarization  states 
at  the  exit  window  of  the  volume  hologram.  In  our  pre¬ 
vious  studies,  the  space  charge  field  £sc  was  presumed  to 
be  uniform  in  amplitude  and  phase  throughout  the  holo¬ 
gram.  However,  self-diffraction  effects  are  known  to  in¬ 
duce  spatial  variations  in  the  amplitude  and/or  phase,  as 
shown  in  Figs,  J-  and  3.  and  so  we  must  study  the  effects 
of  seif-diffraction  on  the  photorefractive  recording  pro¬ 
cess.  considered  next. 

B.  Photorefracrh  e  Recording  Model 

In  the  photorefractive  recording  process,  a  space  charge 
field  Ek{z)  is  induced  in  response  to  an  optical  interfer¬ 
ence  pattern  formed  between  two  writing  beams  with  am¬ 
plitudes  R  and  5  (see  Fig.  3).  A  useful  parameter  for  ana¬ 
lyzing  the  photorefractive  recording  process  is  the 
modulation  depth  m  of  the  optical  interference  pattern, 
which  is  related  to  the  light  amplitudes  R  and  5  by 

a»-2**-S/(|*|s  +  |S|!)  C) 

in  which  the  asterisk  denotes  complex  conjugation,  and 
the  dot  denotes  an  inner  product.  The  modulation  depth 
m  as  defined  above  is  a  complex  number  with  phase  de¬ 
termined  by  the  relative  phase  of  the  optical  interference 
pattern  with  respect  to  the  coordinate  system. 

A  linearized  recording  model  applies  whenever  the  in¬ 
tensity  of  one  of  the  recording  beams  is  significantly 
smaller  than  that  of  the  second  beam,  such  that  |  m  |  « 

1.  The  stronger  beam  is  generally  called  the  pump  beam, 
with  amplitude  R.  and  the  weaker  beam  is  called  the  sig¬ 
nal  beam,  with  amplitude  5.  In  the  linear  recording  and 
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at  least  to  first  order  in  m.  The  electric  field  factor  £ai 
retnains  constant  throughout  the  volume  of  the  hologram, 
■.vnereas  the  mcauiaucn  depth  m  vanes  slowly  with  depth 
ccorainate  c  due  to  energy  coupling  and  also  due  to  the 
nuntdenticai  poianzatton  state  evolution  of  the  two  writ¬ 
ing  laser  beams. 

Detailed  expressions  for  the  field  term  £a,  in  the  linear 
approximation  have  been  published  by  Kukhtarev  e:  a l. 
for  recoraing  stationary  gracings  using  a  singie  mobile 
charge  species  and  single  trap  species  model  [22] .  and 
alternatively  using  two  cvpes  of  photoexcited  camera  and 
■wn  trap  species  in  the  charge  transpon  model  [19].  In  the 
context  of  the  present  analysis,  the  only  effect  induced  by 
the  additional  charge  and  trap  species  is  to  reduce  the  cou- 
piina  between  the  two  light  beams,  and  perhaps  to  modify 
:ts  phase.  For  simplicity,  we  restrict  our  attention  to  a 
sinsrie  mobile  soectes.  singie  trap  species  mode!  for  the  re¬ 
mainder  of  this  article. 

Analytic  expressions  for  the  held  factor  £,3I  for  Dopo- 
ier-enhanced  recording,  assuming  a  single  mobile  charge 
opectes  and  a  singie  trap  site,  have  been  derived  by  Vailev 
[25]  and  Refregier  et  al.  [10].  Fig.  6  shows  tvpicai  mag¬ 
nitudes  of  the  field  factor  £,ai  for  Doppler-enhanced  rec¬ 
ording  at  optimum  speed  to  maximize  the  energy  cou¬ 
pling.  based  upon  the  published  analytic  solutions,  and 
assuming  the  material  parameters  listed  in  Table  I.  The 
grating  speed  has  been  reoptimized  for  each  combination 
of  appiied  bias  field  and  grating  spatial  frequency. 

C.  Numerical  Algorithm 

A  typical  numerical  analysis  of  self-diffraction  effects 
proceeds  as  follows.  First,  one  specifies  two  incident  light 
beams,  including  their  intensities  and  polarization  states, 
at  the  entrance  window  z  =  0  of  the  photorefractive  crys¬ 
tal .  From  this,  one  can  determine  the  modulation  depth 
m(;  =  0)  of  the  intensity  profile,  and  hence  the  space 
charge  field  £sc  ( z  =  0 )  and  die  bireftingerit  coupling  fac¬ 
tor  (discussed  previously  in  [16]).  Next,  the  coupled  wave 
equations  can  be  integrated  directly  to  determine  the  light 
amplitudes  and  polarization  states  at  a  deeper  level  A; 
within  the  crystal.  This  defines  the  space  charge  field  rec¬ 
orded  at  that  depth  through  the  modulation  index  m.  The 
process  is  then  repeated  as  needed  until  the  exit  window 
of  the  photorefractive  crystal  is  reached.  The  numerical 
integration  can  be  performed  either  by  a  Runge-Kutta  al¬ 
gorithm  [27],  [28]  or  by  the  anisotropic  optical  beam 
propagation  method  [29].  It  is  interesting  to  note  that  the 
computation  time  with  self-diffraction  effects  included  is 
only  minimally  longer  than  the  time  for  the  original  set  of 
coupled  wave  equations,  at  least  for  a  linear  model  of  the 
photorefractive  recording  process  in  steady  state.  Also,  it 
must  be  emphasized  that  the  numerical  solutions  pre¬ 
sented  in  this  article  apply  only  to  linear  recording  (|m| 
«  1 )  in  the  steady-state  regime.  This  concludes  the  de- 


Fig.  6.  Magnitude  or  :r.e  >pacs  charge  tieid  ‘ictar  £.„  a  - 

grating  spattat  rreuuency  for  varying  3>uei  it  me  ucpr.eu  rus  -  -  ■ 
achieved  by  tr.e  Doopier-ennancea  recording  ’icnnique  The  g-aung  e- 
■ocitv  nas  been  reootirmied  toreacr.  point  jn  me  runes  bee  u:,o  • 

(2S)  for  simitar  curves  : 


TASLs  i 

Pt*»\lE7ERS  ASSUMES  'V  “HE  £  .  lZ'.  L.,  7'U  v  s' 


3i.;iiC;j  ut 

pcranete- 

.  *  5 ; ;  i  it; 

u>  i  T*.£  7 ' 

retractive  maex  i  ^  i 

absorotion  i  a  I 

• 

cm 

optical  rotatory  power  ■  j  i 

}i  0 

2 d2 * 5 i  mm. 

eiectroopttc  coerficient  •  r., ) 

■ 

intensity  ratio  ( /!0,7,p ) 

1  D  X  MT’ 

d-.mcr.: 

trap  densttv  <  .v,  > 

1.0  x  10'* 

cm 

dielectric  constant  tel 

56 

vlT.n.O".  df  5 

electron  mobtliiv 

0  03 

cnr  V  j 

electron  recombination  rate  <y„  1 

:.o  <  to- 1 

cm  > 

'(*-1.  1331 


scription  of  the  model  and  the  corresponding  numerical 
methods. 

HI.  Discussion  of  Numerical  Solutions 

Having  defined  the  model,  let  us  now  turn  to  a  study  of 
representative  solutions.  Numerical  solutions  given  in 
Figs.  7-16  are  explored  in  Section  UI-A.  To  understand 
aspects  of  these  solutions,  the  diffraction  anisotropies  in¬ 
herent  in  sillenite  crystals  in  the  absence  of  optical  activ¬ 
ity  are  reviewed  in  Section  IH-B.  Finally,  a  heuristic  ar¬ 
gument  is  proposed  in  Section  III-C  to  explain  some  of 
the  more  striking  aspects  of  the  numerical  solutions. 

A.  Numerical  Solutions 

Diffraction  and  polarization  properties  of  a  number  of 
recording  configurations  have  been  studied  numerically, 
with  results  as  shown  in  Figs.  7-16.  Configuration  param¬ 
eters  include  the  crystal  orientation,  the  applied  bias  field, 
the  grating  spatial  frequency,  the  assumption  of  either  a 
stationary  or  a  Doppler-shifted  optical  interference  pat¬ 
tern,  and  the  incorporation  of  either  a  self-diffraction  or  a 
uniform  grating  recording  model.  Two  alternative  com- 
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---n  an.nsicereU  as  repraaentati-.e  of  different  solution 
.:rr.::j-  One  combination  consists  of  a  bias  eiectnc  field  of 
o  kV  cm  and  a  grating  spatial  frequency  of  300 
:yc:;s  mm.  This  aias  held  is  high  enough  to  enhance  sig¬ 
nificant;;.  tne  photosensitivity,  and  the  crating  spatial  rre- 
cuer.cy  ;$  nigh  enough  to  allow  for  excellent  resolution  in 
any  reconstructed  volume  hologram.  In  the  second  com- 
nmation.  the  bias  eiectnc  held  :s  increased  to  10  kV  cm. 
whicn  .s  about  the  upper  limit  for  3SO  :n  practical  appli¬ 
cations.  Tne  grating  ^patiai  frequenev  has  been  reduced 
::cm.  300  to  '0  cycles  mm.  even  though  this  reduces  the 
-.olographic  resolution  capacity,  because  at  this  spatial 
frequency  tne  resulting  space  charge  held  for  the  case  of 
an  optimized  velocity  moving  grating  is  at  a  maximum  for 
a  10  i\V  'em  bias  field .  as  shown  in  Fig.  6.  The  second 
combination  of  parameters  gives  almost  three  times  the 
coupling  strength  as  the  first  configuration,  for  the  case  of 
Dopoier-enhanced  recording.  The  space  charge  held  fac¬ 
tors  5,„  resuiting  from  these  two  combinat’ons  of  param¬ 
eters.  assuming  the  material  parameters  z  en  in  Tabie  1 
and  tr.e  analytic  solutions  published  by  Rufregier  et  ai. 
[10].  are  shown  m  Tabie  II.  The  phase  angles  quoted  in 
Tuoie  II  give  tne  phase  of  the  space  charge  held  relative 
to  the  optical  interference  pattern  in  each  case.  In  addi¬ 
tion.  an  incident  intensity  ratio  (signal  to  pump  beam)  of 
10"  has  been  assumed  throughout  to  assure  applicability 
of  tne  undepleted  pump  approximation.  It  can  be  expected 
that  similar  effects  will  occur  for  higher  incident  intensity 
ratios,  but  modification  of  the  photorefractive  charge 
transport  model  to  incorporate  the  effects  of  a  depleted 
pump  beam  may  then  be  necessary. 

Consider  hrst  the  inhomogeneities  induced  in  the  mod¬ 
ulation  depth  m  by  self-ditfraction.  which  have  been  cal¬ 
culated  for  the  6  kV/cm  bias  field.  300  cycles /mm  grat¬ 
ing  spatial  frequency  recording  combination,  with  results 
as  shown  in  Figs.  4  and  5.  Fig.  4  shows  the  photorefrac¬ 
tive  grating  modulation  depths  as  a  function  of  grating 
thickness  calculated  for  both  oriencacions.  as  well  as  for 
both  running  and  stationary  gratings.  For  the  case  of  sta¬ 
tionary  gratings,  self-diffraction  yields  significant  oscil¬ 
latory  behavior  inthe{A^i|<0Ol>}  orientation,  but  only 
minor  perturbations  in  the  {Ka  X  <00l  >  }  orientation. 
On  the  other  hand,  in  the  case  of  running  gratings,  both 
orientations  yield  striking  modulation  effects.  In  particu¬ 
lar.  the  {  Kg  x  <  001  >  }  configuration  exhibits  a  periodic 
layered  structure  in  the  modulation  depth,  with  regions  at 
particular  grating  thicknesses  that  approach  zero  modu¬ 
lation.  The  photorefractive  grating  phase  angle  is  plotted 
for  the  same  set  of  cases  in  Fig.  5.  For  both  orientations, 
the  phase  is  fixed  throughout  in  the  case  of  running  grat¬ 
ings  at  the  optimum  velocity.  For  stationary  gratings,  the 
{  Kg  x  <  001 )  }  orientation  exhibits  an  oscillatory  phase 
behavior  throughout  the  depth,  while  the  {  Kc  ||  <  00 1  >  } 
orientation  exhibits  a  nonlinear  warping  of  the  phase  fronts 
with  an  additional  component  of  periodic  behavior.  Note 
then  that  the  { A'c  i  <  001  >  }  orientation  in  the  case  of 
stationary  gratings  has  essentially  constant  modulation  but 


'AL  ")F  ’1 

C  Z  • 1 '•  L 

:  -- 

T  -,3L£  .! 

5  P*C~  '.iA.r:  f ' c  _  3  - 

-  L,  31  \  b  . 

'  Z 

3iai  ricf-J  h  "5  *V  jnu 

Jt:n? 

>!  I',-'.::- 

Stationary  ^ mines 

*  5  5  n.v 

jrn  ZZ' 

Dt'Dpivir-qnnanceu: 

£ 

=  i: } 

'  ~rn  _  *0' 

3us  rie:d  01*  0  cm  ana  • 

m:r.z  voauai 

•rc-jucnc;. 

>«  '0  -••css  t.~. 

Stationary 

=  3  i  Kv 

cm  ?• 

Doopier-ennanctu- 

£ 

—  .  2  x  • 

cTi  - 

oscillator.-  pnase.  while 

’.he  same 

areata: 

ion  :n  the  ;a^e 

of  running  gratings  has  constant  phase  but  oscillator- 
modulation. 

Similar  profiles  have  been  calculated  for  the  10  kV  cm. 
70  cycle /mm  combination,  with  results  almost  exact*', 
like  those  shown  in  Figs.  4  and  i.  The  phase  modulation 
is  almost  identical  to  that  shown  in  Fig.  5.  with  aniv  a 
very  slight  increase  for  the  larger  coupling  case.  The  am¬ 
plitude  growth  has  the  same  general  form  as  shown  in 
Fig.  4.  but  spans  a  larger  range  of  values. 

Next  consider  the  polarization  state  evolution  results 
shown  in  Figs.  7-14.  The  advance  of  the  polarization  an¬ 
gle  for  the  {  Kq  :|  ( 001  >  }  orientation,  shown  in  Figs, 
and  9.  predominantly  follows  a  linear  trend  defined  by  the 
magnitude  of  the  optical  activity,  with  a  spatially  periodic 
oscillation  about  this  linear  progression.  The  periodic  os¬ 
cillation  is  induced  by  coupling  of  the  signal  beam  :c  the 
pump  beam,  which  undergoes  its  own  spatially  per.cdic 
evolution  in  polarization  state.  In  the  absence  of  an  ap¬ 
plied  bras  field,  this  spatial  period  is  defined  by  d  = 
lS0®/o.  in  which  p  is  the  optical  rotatory  power,  in  the 
presence  of  a  bias-field-mduced  linear  birefringence  C 
the  spatial  period  is  somewhat  smaller,  and  is  defined  by 
d  =  180°  j{p~  -  C,j)'  *.  The  periodic  perturbation  is 
strongest  for  Doppler-enhanced  self-diffraction,  and 
weakest  for  the  uniform  space  charge  grating  model.  The 
evolution  of  the  ellipticity  for  the  { !|  <  001  > }  orien¬ 
tation.  shown  in  Figs.  8  and  10.  also  exhibits  the  same 
periodicity.  (Ellipticity  is  defined  as  the  ratio  of  the  mine* 
to  the  major  axes  of  the  polarization  ellipse,  with  opposite 
signs  for  right-handed  and  left-handed  electric  vector  ro¬ 
tation.)  The  ellipticity  for  the  Doppler-enhanced  self-dif¬ 
fraction  model  of  the  signal  beam  deviates  most  strongly 
from  that  of  the  pump  beam. 

The  { K0  X  <  001  >  }  orientation  exhibits  even  more 
striking  modulation  of  the  signal  beam  polarization  state 
by  self-diffraction  effects,  at  least  for  Doppler-enhanced 
coupling,  as  shown  in  Figs.  11-14.  Note  the  strong  vari¬ 
ations  in  the  polarization  angle  for  the  Doppler-enhanced 
self-diffracted  signal,  as  shown  in  Figs.  11  and  13.  The 
stationary  grating  model  with  self-diffraction  exhibits  a 
polarization  response  which  is  much  closer  to  that  shown 
by  the  uniform  grating  model.  The  ellipticity  evolution 
for  the  Doppler-enhanced  signal  beam  also  differs  remark¬ 
ably  from  that  of  the  stationary  grating  and  uniform  grat¬ 
ing  models,  as  seen  in  Figs.  12  and  14. 
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Fig.  Evolunon  of  the  polarization  angie  i  major  axis  oneniatiom  ['or  oif- 
frecnon  m  :ne  (  A'c  I  <  001  )  j  onentanon  for  the  undeoieied  pump  ream 
loud  linei.  signal  beam  »un  Doppler-ennanccd  self-diffraction  ai 
dasnea  !inei.  signal  beam  with  a  stationary  grating  and  self-diffraction 
isingle  dashed  line),  and  signal  beam  without  self-diffraction  assuming 
a  pnase  grating  uniform  n  magnitude  and  pnase  throughout  the  holo¬ 
graphic  volume  'douole  dasnea  iinei.  The  pump  and  signal  beams  are 
assumed  to  enter  the  cry  stal  witn  identical  iinear  aoiaruation  states,  mr. 
polarization  angle  parallel  to  me  grating  vase  vector  and  the  aopiieu  bias 
hetd. 


Fig.  8.  Evolution  of  the  eilipriciry  for  diffraction  in  the  {  X«  1  <  001  > } 
orientation  for  the  an  recording  conditions  considered  in  Fig.  7. 


Figs.  IS  and  16  chow  the  degradation  in  energy-cou¬ 
pling  gain  with  increasing  amounts  of  optical  rotation  pd 
in  the  diffusion  and  drift  regimes,  respectively,  for  the 
two  principal  crystallographic  orientations.  The  results 
shown  in  Fig.  IS  are  derived  from  analytic  solutions  pre¬ 
sented  by  Foote  and  Hall  [30],  while  the  results  shown  in 
Fig.  16  are  numerically  derived  from  the  coupled  wave 
equations.  In  both  figures,  the  pump  and  signal  beams  are 
assumed  to  enter  the  photorefractive  medium  with  iden¬ 
tical  linear  polarization  states.  In  each  case,  the  incident 
polarization  angle  is  varied  from  0  to  180*  to  find  the 
minimum  and  the  maximum  coupling  gain;  the  optimi¬ 
zation  of  the  polarization  angle  is  repeated  for  each  value 
of  pd.  These  two  figures  were  derived  by  keeping  the  op- 
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Fig.  9.  Evolution  of  tile  pollination  angle  for  diffraction  n  '.he  1  A' 
<001  >  )  orientation.  11  shown  .n  Fig.  cut  :or  1  meter  tics  te:a  :: 
10  kV  cm  and  a  lower  gracing  spatial  freauenev  of  "0  c  c;es  mm.  i.i,;: 
combine  to  give  almost  a  threefold  increase  :n  the  coupling  strengm 


Fig.  10.  Evolution  of  the  ellipticity  for  diffraction  in  the  {  A'c  !  <001  >  • 
orientation  for  the  same  recording  conditions  considered  in  Fig  9 


Fig.  1 1.  Evolution  of  the  poiarixation  angle  for  recording  conditions  con¬ 
sidered  in  Fig.  7,  but  for  the  orthogonal  { Kc  i  <001  >  ]  recording  on- 
cn  cation. 
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Fig  15  Degradation  of  energy  coupling  gun  T  as  defined  ,n  .If  •>  n 
.ncreasing  levels  of  ooticat  rotation  ad.  .n  wnicn  a  is  the  optica,  -otetor. 
power  and  J  is  the  crystal  thickness,  for  the  two  pnnetpai  recording  ori¬ 
entations.  These  curses  are  based  upon  analytic  solutions  dentes  iv 
Foote  and  Hail  for  diffusion  regime  recording  (30). 


Fig.  13.  Evolution  of  the  polarization  angle  for  recording  conditions  con' 
stdered  in  rig.  9.  but  for  the  { AT<;  J.  <  001  )  }  orientation. 


Fig.  16.  Degradation  of  energy  coupling  with  increasing  levels  of  optical 
rotation  »d  for  Doppler-enhanced  dnft-regtme  recording  Recording 
conditions  are  identical  to  those  assumed  in  Figs.  7  and  1 1 . 


Fig.  1*  Evolution  of  the  ellipttcity  for  recording  conditions  considered  in 

Fig.  (3. 


tical  rotatory  power  p  fixed  at  38.6° /mm  and  vary  ing  the 
crystal  thickness  d,  but  very  similar  curves  have  been  ob¬ 
tained  by  keeping  the  thickness  d  constant  and  varying  the 
rotatory  power  p.  These  curves  have  important  implica¬ 
tions  for  holographic  experiments  which  measure  the  ef¬ 
fective  electrooptic  coefficient.  These  measurements  are 
difficult  to  interpret  correctly  for  a  number  of  reasons,  one 
being  the  intricate  polarization  properties  of  volume  grat¬ 
ings  in  sillenite  crystals.  Figs.  15  and  16  indicate  that  a 
50  percent  reduction  in  coupling  gain  can  easily  be  ac¬ 
counted  for  because  of  degradation  induced  by  optical  ac¬ 
tivity. 

We  have  also  explored  the  impact  on  energy  coupling 
gain  of  using  nonidentically  polarized  incident  pump  and 
signal  beams,  and  of  using  incident  beams  with  other  than 
linearly  polarized  states.  A  modest  increase  in  the  gain  of 
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\iucr.  ::  aae  interesting  polarization  evolution  zena-.  ;or 
exmciteu  r.  F:gs.  “-Ip  certves  from  the  cr.isotrccies  .n- 
neren:  .n  me  .ig.it  tiifracticn  process.  .ncenencen:  i:  :p- 
ticai  activity  'A  hen  the  optical  activity  terms  arc  -.et  easiui 
to  zero,  tne  ;ets  of  coupied  wave  equations  -.vita  setr-iir- 
fraction  ejects  included  can  be  completely  decoupied  into 
ortnogonai  poianzation  eigenmodes.  ana  analytic  solu¬ 
tions  can  be  obtained  for  eacn  eigenmotie.  The  eisenaxes 
are  Jirrerent  for  tne  two  pnncioai  recording  conrigura- 
tions.  anc  are  s.-.own  by  the  i  XE0.  Y.:o .  Z.:o  i  axes  in  rigs. 

I  anc  S. 

For  example,  tr.e  ,  X.-  i  <  SO  I  ■  ’  orientation  sr.own  in 
Fig.  ’.  nas  eigenaxes  X.:J  anc  Y.:0  paraiie:  and  perpendic¬ 
ular  to  tr.e  grating  xave  vector  ana  tne  icpiieo  eiectrtc 
fieia.  respeettveiy.  No  energy  coupling  is  exmbiteo  for 
light  poianzea  uiong  the  X.:o  etger.axis.  Insteaa.  maxi¬ 
mum  coupling  occurs  for  light  polarized  aiong  the  orthog¬ 
onal  axis.  Y£.j  in  Fig.  1.  This  coupling  can  be  either  pos¬ 
itive  or  negative,  depending  upon  the  direction  of  the 
applied  electric  held.  We  assume  throughout  this  paper 
that  the  field  direction  is  oriented  for  positive  coupling. 

The  {  Kg  -  <  001  >  ’  orientation  has  its  two  eigenaxes 
Xc0  and  K.-.j  oriented  at  45’  with  respect  to  the  grating 
wave  vector  and  the  applied  bias  field,  as  shown  in  Fig. 
2.  Positive  energy  coupling  gain  is  exhibited  along  one  of 
these  eigenaxes.  and  negative  energy  coupling  gain  is  ex¬ 
hibited  along  the  orthogonal  axis,  although  the  identifi¬ 
cation  of  which  axis  exhibits  positive  gain  can  be  altered 
by  reversing  the  direction  of  die  applied  bias  field.  This 
can  be  understood  with  reference  to  the  distortion  of  the 
index  ellipsoid  imposed  by  a  uniform  bias  electric  field. 
In  the  absence  of  any  electric  field,  the  index  is  isotropic, 
so  that  the  constant  index  surface  is  a  sphere.  A  uniform 
bias  electric  field  distorts  this  index  surface  into  an  el¬ 
lipsoid  with  principal  axes  aligned  along  the  (Xf0,  Yeo, 
Zf0)  eigenaxes  shown  in  Fig.  2.  The  index  of  refraction 
is  increased  bv  a  small  amount  ±n  along  one  of  the  axes, 
say  the  XEO  axis,  while  it  is  decreased  along  the  Yso  axis. 
Now  consider  a  sinusoidal  space  charge  field,  which  in¬ 
duces  a  corresponding  sinusoidal  phase  grating  for  light 
polarized  along  the  XE0  eigenaxis,  and  a  similar  phase 
grating  for  light  polarized  along  the  Yto  eigenaxis.  Note, 
however,  that  the  grating  induced  along  the  YEo  eigenaxis 
is  180°  out  of  phase  with  respect  to  the  grating  induced 
along  the  XEo  eigenaxis,  because  the  index  shift  along  the 
one  axis  is  equal  and  opposite  to  the  index  shift  along  the 
orthogonal  axis.  Therefore,  if  one  of  the  two  gratings  is 
optimally  phased  for  positive  energy  coupling,  then  the 
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hibits  birefnngent  diffraction  properties.  ciso  vr.owr.  r  me 
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lignt  '29 J.  This  oirefr.ngen.t  moae  is  exnibtec  jr.iv  r-  me 
I  X,j  -  I  20 1  I  mentation,  net  y.  me  a  ..V. 
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ization  properties  in  me  presence  of  ;e:f-c:ff:ac::co.  a. 
explained  next. 

C.  Comparison  of  :iie  Two  Principal  Cr:en:s::ci::. 

A  critical  insight  into  the  light  diffraction  crccess  n  me 
presence  of  seif-uiffnaction  is  that  the  space  a.narae  zratmz 
can  oniv  be  recoraea  where  the  pump  ana  signal  ream.; 
share  the  same  poiar.zation  state,  not  wneti  they  n a e  :r- 
mogonai  states.  Therefore,  an  interesttng  aspect  to  mem- 
in  tne  numerical  solutions  is  the  comparative  aontent  :t 
signal  light  intensity  with  poianzation  -.rate  paraiie:  m  m.a 
onhogonal  to  that  of  the  locai  pump  ream.  Note  me  me 
of  the  term  “local"— the  polarization  state  of  tne  ramp 
beam  continuously  evolves  as  it  propagates  threugn  me 
crystal,  so  that  the  signal  beam  poianzation  decamp ca¬ 
tion  of  interest  similarly  evolves. 

Representative  numerical  solutions  for  these  two  polar¬ 
ization  components  are  shown  in  Fig.  1“  for  the  ;  X 
<001  >}  orientation  and  in  Fig.  IS  for  the  { A’.;  _ 
<001  >  }  orientation.  The  two  polarization  components  m 
these  figures  are  labeled  “coherent"  and  "incoherent.  ' 
referring  to  the  components  with  polarization  states  iden¬ 
tical  to  and  orthogonal  to  the  local  pump  beam,  respec¬ 
tively.  Note  that  the  coherent  component  dominates 
throughout  the  propagation  path  for  the  { K<j  |  <  10  i  •  ■ 
orientation,  as  shown  in  Fig.  17.  Recall  that  this  orien¬ 
tation  exhibited  no  birefnngent  diffraction  modes  in  the 
absence  of  optical  activity.  Addition  of  optical  activity 
introduces  only  a  small  amount  of  birefringent  diffraction 
Compare  this  with  the  response  for  the  {  Kq  x  <  00 1  >  | 
orientation  shown  in  Fig.  18.  Assuming  that  the  incident 
beams  are  polarized  to  maximize  the  energy  coupling 
(which  implies  different  incident  polarizations  tor  tne  two 
configurations),  the  growth  of  the  coherent  component  is 
identical  for  both  recording  orientations,  at  least  for  the 
first  half  millimeter  or  so  of  propagation.  For  the  {  Kc  4 
<001  )  }  orientation,  the  optical  activity  rotates  the  pump 
polarization  angle  so  that  after  the  first  millimeter  or  so 
of  propagation  significant  birefnngent  diffraction  is  en¬ 
countered.  At  this  point,  further  growth  of  the  coherent 
signal  light  is  halted,  while  the  incoherent  component 
continues  to  grow.  One  consequence  is  that  further  growth 
of  the  space  charge  field  which  supports  the  diffraction  is 
similarly  halted.  After  more  propagation,  say  at  about  the 
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Fig.  17.  Growth  of  :he  inconerem  ana  aonerem  signal  beam  components 
for  seif-diffraction  :n  the  ( A'5  1  <  001  ,  J  onentation.  The  coherent  com¬ 
ponent  :s  denned  as  that  wmch  has  a  polarization  slate  identical  to  the 
local  pump  beam,  and  hence  can  interfere  conerently  with  the  pump  beam 
to  form  a  fringe  pattern  and  phoiorefractiveiy  induce  a  space  charge  field. 
The  inconerem  component  nas  an  orthogonal  polarization  state,  and  hence 
can  only  erase  the  space  charge  Seid  Recording  conditions  are  identical 
to  those  assumed  in  Figs.  7  and  3.  except  that  the  incident  polarization 
angle  is  90“  with  respect  to  tne  grating  wave  vector  to  maximize  the 
energy  coupling. 


Fig.  18.  Growth  of  the  incoherent  and  coherent  signal  beam  components 
for  self-dilfraction  in  the  {  x  <  001  )  }  orientation.  Recording  con¬ 
ditions  are  identical  to  those  assumed  in  Figs.  11  and  12.  except  that  the 
incident  polarization  angle  is  -43*  with  respect  to  the  grating  wave  vec¬ 
tor  to  maximize  the  energy  coupling. 

2  mm  depth,  the  optical  activity  has  rotated  the  pump 
beam  polarization  enough  so  that  now  significant  negative 
energy  coupling  is  experienced,  and  the  coherent  signal 
component  loses  intensity  rather  than  gains  in  this  region. 
No  comparable  birefringent  modes  or  negative  gain  axes 
exist  for  the  { Kc  ||  <001  >  }  orientation.  This  explains 
why  the  {  Kc  II  <  001  > }  orientation  exhibits  superior  en¬ 
ergy-coupling  gain  compared  with  the  { KG  ±  <001  > } 
orientation,  as  has  been  reported  in  the  literature  (31]. 

IV.  Conclusions 

A  method  for  calculating  the  polarization  state  evolu¬ 
tion  in  photorefractive  sillenite  crystals  in  the  presence  of 


self-diffraction  effects  has  been  described,  and  represen¬ 
tative  soiutions  have  been  studied.  Seif-diffraction  effects 
are  shown  to  alter  the  diffraction  process  remarkably.  For 
the  {Kc  ||  <001)}  onentation.  self-diffraction  induces 
significant  energy  coupling  gain:  for  the  { KG  X  <  001  >  } 
orientation,  self-diffraction  significantly  modifies  the  po¬ 
larization  states,  especially  for  Doppler-enhanced  record¬ 
ing.  Striking  spatial  inhomogeneities  are  induced  in  the 
space  charge  field,  especially  for  the  {  Kc  X  ( 001  >  •  on¬ 
entation.  leading  to  striations  in  the  space  charge  fieid 
along  the  direction  of  propagation  for  the  case  of  Doppier- 
enhanced  recording,  and  essentially  constant  amplitude 
but  corrugated  phase  fronts  for  stationary  recording.  It  is 
interesting  to  note  that  the  complexity  and  ume  to  com¬ 
pute  for  the  numerical  model  are  only  modestly  increased 
with  the  addition  of  self-diffraction,  at  least  for  the  linear 
photorefractive  recording  model  in  steady  state  used 
herein.  The  diffraction  properties  of  the  two  principal  rec¬ 
ording  configurations  prove  to  be  distinctly  different,  and 
in  part  this  difference  is  attributed  to  fundamental  aniso¬ 
tropies  inherent  in  the  diffraction  process.  Finally,  the 
degradation  in  energy  coupling  gain  with  increasing 
amounts  of  optical  rotation  pd  has  been  calculated  for  i 
few  representative  cases.  This  degradation  can  amount  :c 
of  order  50  percent  and  must  be  considered  in  any  holo¬ 
graphically-based  measurements  of  effective  electroopuc 
coefficients. 
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A  computational  algorithm  for  analyzing  diffraction  properties  of  optical  devices,  the  optical  beam  propagation 
method,  has  suggested  a  new  class  of  devices  by  which  Bragg  regime  (thick  grating)  response  can  be  obtained  from  a 
spaced  sequence  of  thin  grating  layers.  Such  stratified  volume  holographic  optical  elements  (SVHOE’s)  can 
emulate  distributed  volume  gratings  in  terms  of  diffraction  efficiency  and  angular  selectivity  and  in  addition  possess 
periodic  diffraction  properties  that  might  serve,  for  example,  as  interconnections  for  optical  cellular  logic  arrays. 
SVHOE’s  also  offer  a  unique  capability  for  altering  the  device  diffraction  response  on  a  layer-by-layer  basis, 
allowing  for  control  of  both  the  diffraction  peak  width  and  the  angular  separation  of  adjacent  peaks. 


Holographic  optical  elements  are  of  fundamental  im¬ 
portance  to  a  number  of  applications  in  optical  infor¬ 
mation  processing  and  computing,  including  optical 
interconnections,  content-addressable  memories,  and 
various  linear  and  nonlinear  signal  processing  configu¬ 
rations.  Numerical  analyses  of  typical  volume  holo¬ 
graphic  structures  are  critical  for  characterizing  and 
optimizing  such  optical  elements.  One  of  the  most 
flexible  and  broadly  applicable  numerical  tools  for 
studying  the  diffraction  behavior  of  volume  holograms 
is  the  optical  beam  propagation  method  (BPM).12  In 
this  method,  the  distributed  optical  inhomogeneitie9 
that  characterize  a  typical  hologram  are  approximated 
by  a  discrete  sequence  of  physically  and  mathemati¬ 
cally  simplified  elements:  infinitesimally  thin  phase 
and/or  polarization  modulation  layers,  interleaved 
with  optically  homogeneous  layers  of  finite  thickness. 
By  approximating  the  distributed  grating  with  a  suffi¬ 
ciently  large  number  of  these  discrete  elements,  the 
resulting  numerical  model  of  the  grating  can  be 
brought  arbitrarily  close  in  its  response  to  that  of  the 
desired  distributed  bulk  grating. 

The  BPM  concept  of  separating  the  modulation 
process  from  the  diffraction  process  in  turn  suggests  a 
new  class  of  optical  devices:  the  stratified  volume 
holographic  optical  element,  or  SVHOE,3  as  shown  in 
Fig.  1.  The  SVHOE  device  structure  consists  of  a 
sequence  of  thin  photosensitive  holographic  recording 
layers  that  perform  the  optical  modulation  function, 
interleaved  with  optically  passive  buffer  layers,  i.e., 
layers  that  impress  no  modulation  on  the  light  beam 
but  rather  allow  the  diffraction  processes  necessary  for 
thick  grating  response  to  occur.  A  grating  recorded  in 
any  individual  modulation  layer  would  necessarily  ex¬ 
hibit  Raman-Nath  characteristics  because  each  re¬ 
cording  layer  is  quite  thin.  However,  a  surprisingly 
small  number  of  recording  layers,  each  spaced  from  its 
neighbors  by  a  passive  layer  of  appropriate  thickness, 
can  exhibit  pronounced  Bragg  regime  (thick  grating) 
response.  In  addition,  SVHOE  structures  exhibit 
striking  diffraction  characteristics  not  observed  in 
bulk  gratings,  as  demonstrated  below. 

The  SVHOE  structure  is  of  particular  interest  for 
materials  that  can  be  produced  in  only  finite  thickness 
layers  but  that  nonetheless  exhibit  either  strong  mod- 
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ulation  effects  or  fast  response  times.  Examples  of 
such  strongly  modulating  but  limited-thickness  media 
include  III— V  and  II— VI  compound  semiconductor 
multiple  quantum  wells  and  superlattice9,  and  or¬ 
dered  noncentrosymmetric  polymer  layers  character¬ 
ized  by  significant  electro-optic  coefficients. 

For  simplicity,  the  following  discussion  emphasizes 
the  angular  alignment  sensitivity  of  the  grating  struc¬ 
ture;  alternative  performance  measures,  such  as  sensi¬ 
tivity  to  the  wavelength  of  the  readout  light,  corre¬ 
spond  closely  to  this  angular  alignment  sensitivity 
metric.  To  measure  the  angular  alignment  response, 
a  grating  structure  is  illuminated  with  a  well-colli¬ 
mated  single-wavelength  laser  beam.  That  fraction  of 
the  incident  light  intensity  diffracted  into  an  adjacent 
diffraction  order,  such  as  the  +1  order,  is  measured 
with  a  photodetector  as  a  function  of  the  tilt  angle  of 
the  grating. 

The  angular  response  for  a  representative  distribut¬ 
ed  bulk  grating  is  shown  in  the  top  half  of  Fig.  2  to 
serve  as  a  reference  for  comparison  with  sample  re¬ 
sponse  curves  for  typical  SVHOE  structures.  Such  a 
uniform  volume  (or  distributed  bulk)  grating  has  an 
alignment  sensitivity  curve  characterized  by  essential¬ 
ly  a  sine2  profile.  The  angular  width  of  this  response 


Fig.  1.  The  SVHOE  structure,  showing  the  recording  of  a 
grating  with  wave  vector  Kg  by  means  of  two  coherent, 
collimated  recording  beam9 
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curve  is  inversely  proportional  to  the  grating  thick¬ 
ness:  a  thin  grating  has  a  broad  response  curve  indic¬ 
ative  of  Raman-Nath  regime  behavior,  whereas  a 
thick  grating  has  a  quite  narrow  response  curve  indica¬ 
tive  of  Bragg  regime  behavior.4 

Comparative  angular  response  curves  for  several 
SVHOE  structures  are  shown  in  Figs.  2  and  3.  In 
these  figures,  the  recording  layers  in  each  thin  grating 
stack  are  assumed  for  simplicity  to  have  infinitesimal 
thickness  and  to  impose  only  phase  (as  opposed  to 
amplitude)  modulation.  When  the  structure  com¬ 
prises  several  thin  phase  modulation  layers,  the  sum  of 
the  phase  modulation  for  all  layers  has  been  set  equal 
to  that  of  the  reference  case  of  the  uniform  volume 
grating  (Fig.  2,  top)  and  is  divided  equally  among  all 
the  thin  recording  layers.  The  thickness  of  each  opti¬ 
cally  passive  buffer  layer  is  adjusted  such  that  the 
total  device  thickness  is  equal  to  that  of  the  thick 
uniform  volume  grating.  Finally,  the  index  of  refrac¬ 
tion  of  the  buffer  layers  has  been  set  equal  to  the 
average  index  of  the  modulation  layers. 


Fig.  2.  Angular  response  for  a  distributed  thick  grating 
(top),  for  a  single  thin  grating  (horizontal  line,  bottom),  and 
for  a  pair  of  thin  gratings  (bottom). 


Consider  first  a  single  thin  grating  with  modulation 
strength  equal  to  that  of  the  bulk  grating.  Such  a  thin 
grating  has  negligible  angular  alignment  sensitivity,  as 
shown  by  the  horizontal  line  in  the  bottom  half  of  Fis. 
2.  The  angular  alignment  sensitivity  of  a  two-grating 
structure,  however,  exhibits  essentially  a  sinusoidal 
oscillation,  as  shown  in  the  bottom  half  of  Fig.  2.  with 
an  angular  period  defined  by  the  ratio  of  the  grating 
period  to  the  thickness  separating  the  two  gratings. 
(Note  that  this  structure  has  potentially  interesting 
applications  as  a  high-resolution  angle  encoders 

A  stack  of  five  thin  gratings,  as  shown  in  Fig.  3, 
suppresses  three  of  every  four  peaks  that  appear  when 
only  the  two  outer  gratings  are  present.  In  general,  a 
stack  of  ,Y  thin  gratings  exhibits  an  angular  alignment 
sensitivity  that  is  periodic,  with  a  period  of  ,Y  -  1  two- 
grating  peaks,  in  which  <V  -  2  of  the  two-grating  peaks 
have  been  suppressed  by  the  interior  gratings. 

For  a  given  angular  interval,  the  incorporation  of  a 
sufficiently  large  number  of  thin  gratings  leads  to  a 
structure  with  an  angular  alignment  sensitivity  that  is 
indistinguishable  from  that  of  the  bulk  grating.  Thus 
a  SVHOE  structure  can  emulate  closely  the  Bragg 
response  of  a  bulk  grating,  using  surprisingly  few  thin 
grating  layers.  In  addition,  this  structure  exhibits 
features  not  found  in  bulk  gratings  that  may  prove 
useful  for  certain  system  applications.  For  example, 
illumination  of  a  SVHOE  structure  with  focused 
monochromatic  light  produces  uniformly  spaced  an¬ 
gular  response  peaks,  which  can  be  transformed  with  a 
lens  to  produce  an  array  of  equally  spaced  points. 
This  function  is  useful  for  interconnection  of  optical 
cellular  logic  arrays.  For  example,  illumination  with 
850-nm  light  of  a  two-grating  SVHOE  with  a  grating 
separation  of  1  cm  and  a  grating  spatial  frequency  of 
350  cycles/mm  at  FI 3.3  will  produce  in  excess  of  1000 
regularly  spaced  diffracted  beams. 

Further,  for  real-time  material  implementations  of 
SVHOE  structures  in  which  the  photoresponse  for 
grating  formation  can  be  either  optically  or  electrically- 
switched  on  a  layer-by-layer  basis,  a  structure  with  a 
given  number  of  layers  can  interconnect  either  every 
element  or  every  pth  element  with  the  source,  in  a 
programmable  manner.  For  example,  such  layer-by- 
layer  programmability  may  prove  feasible  by  either 
optical  bleaching5  or  electrical  tuning6  of  the  exciton 
resonance  in  a  multiple  quantum  well  structure  or  by- 
modulation  of  a  doping  superlattice.7  SVHOE  struc¬ 
tures  can  also  be  conceived  that  exhibit  entirely  differ¬ 
ent  diffraction  behavior  in  response  to  grating  forma¬ 
tion  by  distinct  writing  wavelengths,  by  actively  alter¬ 
ing  the  layer  photosensitivity  spectrum  on  a  layer-by- 
layer  basis.  These  features  collectively  allow  for 
additional  degrees  of  freedom  in  the  formulation  of 
both  passive  and  active  holographic  elements. 

Several  simple  experiments  have  been  performed  to 
demonstrate  and  verify  the  SVHOE  concept.  Phase 
gratings  have  been  recorded  in  positive  photoresist 
(Shipley  1450J)  deposited  upon  microscope  cover 
glass  pieces  (22  mm  X  22  mm,  approximately  200  ym 
thick).  The  grating  frequency  was  adjusted  to  be  74 
cycles/mra,  as  recorded  by  a  442-nm  laser  interferome¬ 
ter.  Readout  was  performed  by  a  633-nm  laser  beam, 
which  probed  either  individual  gratings  or  stacks  of 
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Fig.  4.  Angular  response  for  three  gratings,  each  spaced 
from  the  next  by  a  single  glass  plate  thickness:  experiment 
(solid  curve)  and  computer  model  (dashed  curve). 


gratings.  Gratings  were  exposed  individually,  pro¬ 
cessed,  and  then  assembled  using  moire  imaging  tech¬ 
niques  to  achieve  proper  alignment  both  in  angle  and 
in  linear  translation  with  respect  to  each  other. 

The  angular  response  has  been  measured  and  com¬ 
pared  with  theoretical  calculations  for  the  following 
cases:  (l)a  single  grating,  (2)  two  gratings  separated 
by  200  Mm  (the  thickness  of  a  single  glass  plate),  (3) 
two  gratings  separated  by  400  Mm,  (4)  two  gratings 
separated  by  800  Mm,  and  (5)  three  gratings,  each  sepa¬ 
rated  from  the  next  by  200  Mm,  for  a  total  device 
thickness  of  400  Mm.  In  all  cases,  the  agreement  be¬ 
tween  experiment  and  the''"1'  is  excellent.  Represen¬ 
tative  experimental  and  theoretical  curves  for  the 
three-grating  case  are  shown  in  Fig.  4. 

The  peak  diffraction  efficiency  of  a  SVHOE  struc¬ 
ture  is  a  function  not  only  of  the  modulation  strength 
of  each  grating  layer  but  also  of  the  spacing  between 
the  layers.  A  single  thin  phase  grating  can  diffract  no 
more  than  33.9%  of  the  incident  readout  light  power 
into  the  +  lst  diffraction  order,  assuming  a  sinusoidal 
grating  profile.  A  two-grating  structure  can  diffract 
as  much  as  67.7%,  as  discussed  by  Zel’dovich  et  al.,s  or 
as  little  as  22.2%,  assuming  the  same  grating  strength 
for  each  layer  but  changing  only  the  buffer  layer  thick¬ 
ness.  A  three-grating  structure  can  diffract  as  much 
as  87.0%,  a  four-grating  structure  in  excess  of  90%,  and 
a  five-grating  structure  more  than  95%,  if  the  buffer- 
layer  thickness  is  chosen  to  be  optimum  in  each  case. 
Maximum  diffraction  efficiency  (at  least  for  gTating 
strengths  ranging  from  zero  to  the  first  maximum  in 
diffraction  efficiency)  occurs  when  the  thickness  of 
each  buffer  layer  is  given  by 

Qbuffer  *  ^A>uffer/n*^2  =  2ir(m  +  1/2),  (l) 

in  which  X  is  the  light  wavelength.  Differ  ‘8  the  buffer- 
layer  thickness,  n  is  the  index  of  refraction  of  the 
buffer  layers,  A  is  the  period  of  the  grating,  and  m  is  an 
integer.  Thus,  neglecting  the  thickness  of  individual 


grating  layers  and  counting  only  the  buffer  lavers.  the 
optimum  total  thickness  Otota|  for  a  structure  with  .V 
gratings  is  given  by 

^tntai  =  'VA„tai''nA'!  =  2x(m  +  1/2X.Y  -  D.  •  2i 

Note  that  the  broadest  spatial-frequency  response  oc¬ 
curs  when  the  smallest  optimal  thickness  is  employed, 

i.e..  when  Qt„tai  =  i.Y  -  lk. 

The  angular  response  analysis  shown  in  Figs.  2  and  3 
has  assumed  infinitesimally  thin  recording  layers. 
The  effect  of  finite  recording-layer  thicknesses  is  to 
introduce  an  angular  response  rolloff  associated  with 
any  single  layer,  which  in  turn  serves  as  an  envelope 
function  to  modulate  the  angular  response  of  more 
intricate  structures  constructed  from  multiple  layers. 
In  fact,  the  angular  response  characteristic  is  just  the 
Fourier  transform  of  the  depth  distribution  of  the 
grating  strength.9  Thus  for  certain  applications  one 
design  contraint  might  be  an  upper  bound  on  the 
thickness  of  each  recording  layer  to  keep  the  overall 
response  rolloff  envelope  sufficiently  broad. 

We  have  demonstrated  that  SVHOE’s  can  satisfac¬ 
torily  emulate  distributed  bulk  gratings  over  a  pre¬ 
scribed  angular  interval.  However,  even  more  in¬ 
triguing  are  those  attributes  unique  to  SVHOE’s  that 
suggest  new  device  applications.  Specifically,  three 
features  have  been  identified  to  date:  (D  the  ability 
to  control  the  recording  sensitivity  of  individual  re¬ 
cording  layers,  independent  of  adjacent  layers,  either 
optically  or  electrically,  thus  altering  the  diffraction 
characteristics  of  the  device;  (2)  the  ability  to  record 
holograms  with  distinct  diffraction  characteristics  at 
two  or  more  wavelengths,  also  through  the  use  of  elec¬ 
trical  or  optical  control  on  a  layer-by-layer  basis,  and 
(3)  the  existence  of  periodic  multiple  response  peaks 
in  the  angular  response  profiles. 
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Introduction 

Over  the  past  four  decades,  the 
growth  of  information  processing  and 
computational  capacity  has  been  truly 
remarkable,  paced  to  a  large  extent  by 
equally  remarkable  progress  in  the  inte¬ 
gration  and  ultra-miniaturization  of 
semiconductor  devices.  And  yet  it  is 
becoming  increasingly  apparent  that 
currently  envisioned  electronic  proces¬ 
sors  and  computers  are  rapidly 
approaching  technological  barriers  that 
delimit  processing  speed,  computa¬ 
tional  sophistication,  and  throughput 
per  unit  dissipated  power.  This  realiza¬ 
tion  has  in  turn  led  to  intensive  efforts 
to  circumvent  such  bottlenecks  through 
appropriate  advances  in  processor 
architecture,  multiprocessor  distributed 
tasking,  and  software-defined 
algorithms. 

An  alternative  strategy  that  may  yield 
significant  computational  enhance¬ 
ments  for  certain  broad  classes  of  prob¬ 
lems  involves  the  utilization  of  multi¬ 
dimensional  optical  components 
capable  of  modulating  anchor  redirect¬ 
ing  information-carrying  light  wave- 
fronts.  Such  an  optical  processing  or 
computing  approach  relies  for  its  com¬ 
petitive  advantage  principally  on  mas¬ 
sive  parallelism  in  conjunction  with 
relative  ease  of  implementation  of  com¬ 
plex  (weighted)  interconnections 
among  many  (perhaps  simple)  process¬ 


ing  elements.  A  wide  range  of  computa¬ 
tional  problems  exist  that  lend  them¬ 
selves  quite  naturally  to  optical 
processing  architectures,  including  pat¬ 
tern  recognition,  earth  resources  data 
acquisition  and  analysis,  texture  dis¬ 
crimination,  synthetic  aperture  radar 
(SAR)  image  formation,  radar  ambiguity 
function  generation,  spread  spectrum 
identification  and  analysis,  systolic 
array  processing,  phased  array  beam 
steering,  and  artificial  (robotic)  vision. 
In  addition,  many  neural  network  pro¬ 
cesses  that  inherently  rely  on  intricate 
interconnection  patterns  have  been  or 
can  be  implemented  optically.  These 
and  other  applications  are  treated  in 
more  detail  in  accompanying  articles  in 
this  special  issue  of  the  MRS  Bulletin1-3 
and  in  special  issues  of  IEEE  Proceed¬ 
ings- '  and  Optical  Engineering ’  on  optical 
computing. 

A  generalized  optical  processor  or 
computer  can  be  depicted  schematically 
as  shown  in  Figure  1.  The  physical  con¬ 
stituent  elements  of  such  a  system 
include  a  central  processing  unit  (CPU) 
that  performs  the  essential  imple- 
mentable  function,  a  data  management 
processor  that  orchestrates  the  flow  of 
data  and  sequence  of  operations  (usu¬ 
ally  considered  part  of  the  CPU  in  a  tradi¬ 
tional  electronic  computer),  several 
types  of  memory  elements  for  both 
short-term  and  long-term  data  storage 


and  buffering,  format  devices  to  spa¬ 
tially  organize  input  data  fields,  input 
devices  to  convert  data  input  types  to  a 
form  amenable  to  subsequent  process¬ 
ing,  output  devices  to  convert  pro¬ 
cessed  results  to  detectable  and  inter¬ 
pretable  forms,  and  detectors  to 
produce  externally  addressable  results. 
In  Figure  1,  feedback  interconnects  are 
explicitly  shown  as  separate  compo¬ 
nents  to  emphasize  their  crucial  role  in 
implementing  parallel  iterative 
algorithms  and  complex  weighting 
functions. 

A  wide  variety  of  optical  components 
are  required  to  implement  processors 
based  on  the  generalized  architecture 
shown  in  Figure  l.5  These  include  one- 
and  two-dimensional  spatial  light  mod¬ 
ulators,  volume  holographic  optical  ele¬ 
ments,  threshold  arrays,  optical 
memory  elements,  sources,  source 
arrays,  detectors,  and  detector  arrays. 
The  state  of  the  technology  is  such  that 
while  demonstration  devices  and  proto¬ 
types  are  proliferating,  with  the  excep¬ 
tion  of  sources,  detectors  and  detector 
arrays,  few  (if  any)  such  components 
have  as  yet  achieved  significant  com¬ 
mercial  success  or  even  demonstrated 
technological  viability.  In  large  part,  this 
is  due  to  the  fact  that  each  of  the  candi¬ 
date  technologies  has  placed  rather 
severe  demands  on  the  state-of-the-art 
of  the  materials  employed  regardless  of 
the  nature  of  the  optical  effect  utilized 
(e.g.,  electrooptic,  magnetooptic,  pho- 
torefractive,  or  electroabsorptive).  In 
other  words,  the  magnitudes  of  observ¬ 
able  optical  perturbations  per  unit  exci¬ 
tation  are  just  not  large  enough  with 
readily  available  materials  to  allow  flex¬ 
ible  device  engineering.  As  such,  the 
answer  to  whether  optical  processing 
and  computing  will  come  of  age  may 
ultimately  rest  on  the  capabilities  and 
fortunate  discoveries  of  materials  scien¬ 
tists  and  process  engineers. 

It  is  of  considerable  interest,  nonethe¬ 
less,  to  examine  the  physical  as  well  as 
technological  limitations  that  apply  to 
optical  information  processing  and  com¬ 
puting  systems  in  order  to  assess  their 
potential  performance  advantages  (if 
any)  over  comparable  electronic  coun¬ 
terparts,  and  to  provide  much-needed 
guidance  for  continued  research  efforts 
in  optical  materials,  devices,  algorithms, 
and  system  architectures.  Although  the 
study  of  fundamental  physical  limita¬ 
tions  in  the  context  of  digital  (binary) 
electronic  systems  (particularly  VLSI)  is 
well  established,07  it  should  be  noted 
that  comparable  studies  of  optical  pro¬ 
cessing  and  computing  systems  are  rela- 
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Figure  1.  Schematic  diagram  of  the  elements  of  a  generalized  optical  processor  or  computer 
(after  Kef.  5). 


lively  recent,  and  have  to  date  focused 
primarily  on  digital  optical  computing  as 
opposed  to  analog  optical  processing.®-11 
The  remainder  of  this  article  will 
describe  several  such  physical  and  tech¬ 
nological  limitations  of  both  optical 
information  processing  and  computing. 
The  following  section  addresses  the 
nature  of  computation  from  the  per¬ 
spective  of  identifying  physical  ( technol¬ 
ogy-independent )  limits.  The  next  section 
will  then  provide  an  example  of  a  tech¬ 
nology-dependent  limit  in  the  context  of 
photorefractive  volume  holographic 
optical  interconnections.  Conclusions 
are  drawn  in  a  final  section. 


The  Nature  of  Computational 
Constraints 

In  order  to  implement  any  computa¬ 
tional  algorithm  on  a  machine  of  given 
architecture  and  component  hardware, 
we  must  first  choose  a  representation  for 
inputting  data  and  executing  computa¬ 
tional  steps.  Such  a  representation 
might,  for  example,  be  digital,  analog, 
or  even  symbolic.  As  we  shall  see,  such 
a  choice  of  representation  has  profound 
implications  on  the  physical  limits  that 
apply  to  subsequent  computation 


For  a  given  representation,  we  can 
then  assess  the  computational  complexity 
of  the  chosen  algorithms.  We  define  the 
computational  complexity  of  an  opera¬ 
tion  as  die  equivalent  number  of  irre¬ 
ducible  binary  switching  operations 
required  to  perform  the  same  operation 
in  the  most  efficient  manner  possible. 
For  example,  the  relatively  simple 
operation  of  transferring  1,000  ten-bit 
words  from  the  CPU  to  memory  will 
require  at  the  very  least  10*  binary 
switching  operations  (assuming  that  all 
of  the  data  is  transferred  in  parallel),  and 
most  likely  considerably  more  if  shift 
registers  are  employed  to  multiplex  the 
data  transfer.  This  definition  of  compu¬ 
tational  complexity  provides  a  conve¬ 
nient  means  for  comparing  the  overall 
efficiency  or  total  minimum  energy  cost 
of  a  given  computation  performed  by 
different  algorithms  on  a  machine  of 
given  architecture  and  technology,  or 
by  machines  based  on  vastly  different 
architectures  and  perhaps  disparate 
technological  hardware.  It  does  not, 
however,  take  into  account  the  inter¬ 
connection  complexity  required  to 
implement  communications  pathways 
at  the  device,  circuit,  and  subsystem 
levels. 


Finally,  we  must  end  each  computa¬ 
tion  with  a  detection  of  the  desired 
results,  in  order  to  allow  the  result  to  be 
used  (for  example,  to  implement  a 
desired  action).  This  separation  of  every 
computational  process  into  the  imple¬ 
mentation  of  representation,  computa¬ 
tional  complexity,  and  detection  func¬ 
tions  allows  us  to  establish  sets  of 
interrelated  limits  that  apply  to  various 
combinations  of  choices. 

For  purposes  of  discussion  in  this 
article,  let  us  examine  each  of  these 
functions  from  the  point  of  view  of 
energy  cost.  In  so  doing,  we  seek  to 
identify  physical  limits  on  the  maximum 
computational  throughput  achievable 
per  watt  of  dissipated  power.  It  should 
perhaps  be  noted  here  that  the  "require¬ 
ment"  of  energy  dissipation  is  not  in 
fact  a  fundamental  limit,  and  derives 
instead  from  a  system  design  demand 
to  assert  each  stored  value  as  rapidly  as 
possible  in  order  to  complete  the  entire 
computation  deterministically  and  in 
minimum  time.  Thus  adiabatic  compu¬ 
tational  processes  which  do  not  require 
a  minimum  energy  dissipation’-2  are  not 
applicable  to  the  present  discussion. 

In  the  digital  (binary)  realm,  the  low¬ 
est  possible  energy  cost  of  representing 
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a  number  requiring  a  given  number  of 
bits  is  equal  to  the  number  of  bits  times 
the  minimum  energy  to  store  a  single 
bit.  For  semiconductor  electronic 
switches,  the  minimum  energy  in  turn 
is  equal  to  a  few  tens  of  kBT  per  elec¬ 
tron6"  times  the  number  of  electrons 
required  to  guarantee  detection  with  a 
given  bit  error  rate  (BER)  [a  quantum 
rather  than  a  thermal  limit].  For  a  bit 
error  rate  of  10~9  or  so,  about  ten  elec¬ 
trons  are  required  (assuming  an  "ideal" 
detector  (or  following  switch]  that  can 
unambiguously  differentiate  between 
the  presence  and  absence  of  a  single 
electron).  Therefore  each  switching 
event  (or  representation  of  each  bit) 
requires  the  dissipation  of  approxi¬ 
mately  200  k3T.  This  places  an  immedi¬ 
ate  upper  boundary  on  the  maximum 
computational  throughput  of  such  an 
electronic  digital  (binary)  computing 
engine  of  approximately  1018  transitions 
(irreducible  binary  switching  opera¬ 
tions)  per  second  per  watt  at  room  tem¬ 
perature.  Note  that  this  bound  does  not 
include  the  assessment  of  any  cost  for 
internal  communication  of  information, 
as  would  be  required  to  execute  an 
actual  computation. 

To  place  this  number  in  proper  per¬ 
spective,  we  need  only  look  at  the 
switching  energies  representative  of 
current  semiconductor  technology.  For 
electronic  devices,  we  might  examine  a 
typical  CMOS  capacitor  with  a  100  /im: 
cross-sectional  area  and  a  1,000  A  oxide 
thickness,  for  which  the  charge-dis¬ 
charge  cycle  consumes  CV2  worth  of 
energy.  Lf  we  operate  at  the  minimum 
switching  voltage  of  a  few  tens  of 
ksT/q,s"  the  snatching  energy  is  about 
10*  kBT  at  room  temperature,  about  four 
orders  of  magnitude  above  the  fun¬ 
damental  limit.  Current  digital  logic  cir¬ 
cuits  operate  at  roughly  2.5  x  10'::  ]/ 
transition,  which  is  about  10“  kBT."  This 
is  also  roughly  the  minimum  energy 
required  to  operate  even  a  local  inter¬ 
connection  at  GHz  rates/ *  an  unavoid¬ 
able  cost  of  communication  between 
devices  at  the  circuit  level.  At  the  sys¬ 
tem  level,  a  vast  amount  of  additional 
overhead  comes  into  play.  For  example, 
the  DEC  VAX  11-250  dissipates  approxi¬ 
mately  3  kW  running  fully  loaded  at  its 
maximum  throughput  rate  of  750,000 
instructions /second  (200,000  opera¬ 
tions/second).  Thus  the  energy  required 
to  perform  a  single  instruction  is  about 
3  mj  or  10IS  ksT.  which  corresponds  to  a 
throughput  rate  of  250  instructions/ 
second/ watt. 

By  comparison,  analog  representa¬ 
tions  (as  used  extensively,  for  example. 


in  optical  processors)  require  far  more 
energy  than  the  binary  equivalent.  This 
is  due  to  the  necessity  of  utilizing  a 
much  higher  particle  count  (electrons  or 
photons)  in  order  to  minimize  the 
effects  of  quantum  statistical  fluctua¬ 
tions  on  the  BER.  For  example,  if  we 
wish  an  analog  representation  of  the 
number  1,000,  then  we  require  a 
dynamic  range  of  at  least  1,000:1.  For 
incoherent  illumination,  quantum  fluc¬ 
tuations  in  the  emission/detection  pro¬ 
cess  produce  a  photon  number  distribu¬ 
tion  with_a  relative  standard  deviation 
of  <r  s  VM/N.  The  equivalent  of  a  10~9 
BER  for  the  digital  case  corresponds  to 
roughly  12  standard  deviations.  There¬ 
fore,  the  number  of  photons  required 
must  be  greater  than  1.5  x  109  from  sta¬ 
tistical  considerations  alone.  For  a  GaAs 
semiconductor  laser  characterized  bv  a 
photon  energy  of  -1.5  eV,  this  corre¬ 
sponds  to  about  1010  k8T.  To  represent 
1,000  optically  in  binary  requires 
approximately  14  bits  (10  bits  for  the 
number  plus  4  bits  of  overhead)  at  15  eV 
each  (10  photons  at  1.5  eV  each,  assum¬ 
ing  direct  detection  and  an  ideal  detec¬ 
tor),  or  about  104  ksT. 

Given  the  remarkably  higher  cost  of 
analog  representation  as  compared  with 
digital,  any  competitive  advantages  for 
analog  systems  from  the  perspective  of 
an  energy  dissipation  metric  due  to 
physical  limits  must  come  from 
enhanced  computational  complexity. 
The  various  tradeoffs  involved  can  per¬ 
haps  best  be  described  in  terms  of  an 
example.  Consider,  then,  a  highly  inter¬ 
connected,  nonlocal  problem  such  as 
the  Fourier  transformation  of  a  two- 
dimensional  function.  Assume  that  the 
function  is  sampled  on  a  1,000  x  1,000 
element  array  at  10  bits. 

If  we  operate  in  the  binary  regime,  we 
can  utilize  a  highly  efficient  discrete 
Fourier  transform  (DFT)  routine  such  as 
the  Coolev-Tukev  algorithm.  The  num¬ 
ber  of  complex  operations  (multiplies 
and  adds)  scales  as  1.5iY:log;iV:  for  an 
N  x  .V  input.  If  we  assume  an  electronic 
machine  implementation  that  requires 
20  switching  events/bit/ complex  opera¬ 
tion,  then  approximately  1010  switching 
events  are  required  to  perform  the  com¬ 
putation,  or  10  nj  for  operation  near  the 
physical  limits  described  above. 

If.  on  the  other  hand,  the  input  data 
field  (two-dimensional  function)  is  rep¬ 
resented  in  analog  form  by  means  of  a 
spatial  light  modulator,  illumination  by 
a  coherent  wavefront  will  produce  the 
required  transform  in  the  back  focal 
plane  of  a  (following)  lens.  Here  we  see 
illustrated  the  notion  of  a  computation 


as  the  transformation  of  information.  The 
energy  cost  of  this  particular  operation 
accrues  only  to  the  initial  representation 
of  the  function  (essentially,  the  optical 
or  electronic  addressing  of  the  spatial 
light  modulator)  and  to  the  subsequent 
detection  of  the  final  result.  This  is  then 
equivalent  to  2  x  106  individual  pixel 
detection  operations  for  the  optically 
addressed  case,  or  70  pj  at  the  physical 
limits  for  1.5  eV  photons,  still  about  four 
orders  of  magnitude  larger  than  the 
binary  equivalent.  The  computational 
complexity  implemented  by  the  analog 
optical  processor  is  thus  seen  to  par¬ 
tially  offset  the  large  initial  difference  in 
representation  energy  costs.  For  other 
classes  of  problems  with  even  higher 
inherent  computational  complexity,  the 
physical  boundary  on  energy  cost  may 
thus  favor  an  analog  approach. 

The  situation  looks  quite  a  bit  differ¬ 
ent  if  we  examine  current  technological 
(rather  than  more  fundamental)  con¬ 
straints.  For  current  digital  circuits  that 
switch  at  around  2.5  pj/'bit,  the  DFT  just 
described  dissipates  25  mj  for  the 
required  number  of  switching  opera¬ 
tions  alone.  Current  digital  systems 
such  as  the  VAX  11-750  consume  about 
10'4  J/bit,  or  1  MJ  for  the  DFT.  With 
regard  to  optical  systems,  spatial  light 
modulators  are  available  with  input  sen¬ 
sitivities  of  about  200  pj/pixel  at  high 
signal-to-noise  ratio,  and  CCD  detector 
arrays  dissipate  approximately  1  mj/ 
Mpixel  on  readout.  Hence  the  analog 
optical  Fourier  transform  can  be  per¬ 
formed  for  an  energy  cost  of  about  1.2 
mj.  This  apparent  capability  of  analog 
optical  systems  to  generate  significantly 
enhanced  computational  throughput 
per  unit  input  power  is  thus  due  largely 
to  the  fact  that  currently  available  ana¬ 
log  optical  components  operate  far 
closer  to  the  relevant  thermal  and  quan¬ 
tum  limits  than  current  digital  electronic 
(VLSI)  components.  It  should  be  noted 
that  such  a  comparison  begs  the  ques¬ 
tion  of  overall  computational  accuracy, 
which  clearly  favors  the  digital  imple¬ 
mentations  due  to  nonlineariries  and 
nonuniformities  in  currently  available 
analog  optical  components. 

The  third  principal  component  of  the 
computational  process,  that  of  detection 
of  the  results  (whether  electronic  or 
optical),  is  subject  to  the  same  thermal 
and  quantum  statistical  limitations  as 
the  process  of  representation.  This 
statement  follows  directly  from  an 
assumption  inherent  in  assessing  the 
minimum  energy  required  for  represen¬ 
tation —  that  each  number  must  be 
capable  of  detection  at  a  given  BER, 
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assuming  an  ideal  detector.  Technologi¬ 
cal  constraints  can  then  be  added  to  the 
fundamental  limit  problem  as  before  to 
assess  realistic  current  capabilities.  For 
example,  available  optical  detectors 
typically  require  a  mean  photon  number 
of  1,000  for  a  10*’  BER  in  the  direct 
detection  mode,  rather  than  the  value  of 
10  assumed  above. 

Volume  Holographic  Optical 
Interconnections 

As  pointed  out  in  the  Introduction 
(and  discussed  in  detail  in  Ref.  1),  the 
ability  to  provide  complex,  multidimen¬ 
sional,  programmable,  weighted  inter¬ 
connections  by  means  of  volume  holo¬ 
grams  is  an  attractive  feature  of  optical 
processing  and  computing  systems.  In 
addition,  this  ability  perhaps  more  so 
than  any  other  contributes  significant 
computational  complexity  to  optical 
architectures.  To  this  end,  it  is  of  con¬ 
siderable  importance  to  assess  the  fun¬ 
damental  limitations  that  apply  to  such 
volume  holographic  optical  intercon¬ 
nections. 

Although  many  types  of  photosensi¬ 
tive  media  can  be  utilized  for  the  record¬ 
ing  and  readout  of  volume  holograms, 
the  dynamic  reprogrammability  offered 
by  photorefractive  crystals  such  as 
lithium  niobate,  barium  titanate,  bis¬ 
muth  silicon  oxide,  and  gallium 
arsenide  has  made  such  materials  the 
objects  of  intensive  study.  In  these 
matenals,  the  interference  between  two 
coherent  optical  wavefronts  (the  signal 
and  reference  beams)  generates  a  space- 
variant  photoexdtation  distribution  that 
produces  in  turn  a  related  space  charge 
redistribution  and  associated  electric 
field  pattern  by  carrier  drift  and  diffu¬ 
sion.  Many  interesting  limits  apply  to 
such  a  process,  including  the  maximum 
number  of  independent  interconnec¬ 
tions  that  can  be  sequentially  or  simulta¬ 
neously  recorded  at  a  given  value  of 
allowable  crosstalk,  the  highest  achiev¬ 
able  diffraction  efficiency  of  each  inde¬ 
pendent  interconnection  at  the  maxi¬ 
mum  interconnection  density,  the 
absolute  minimum  number  of  photoev¬ 
ents  per  unit  volume  required  to  record 
an  interconnection  of  given  analog 
weight  within  the  quantum  fluctuation 
limits  for  statistical  accuracy  of  record¬ 
ing  and  reconstruction  (readout),  and 
the  maximum  asymmetry  possible 
between  the  recording  and  erasure  pro¬ 
cesses.  :s  An  additional  limit  of  consider¬ 
able  importance  is  that  of  the  photore¬ 
fractive  sensitivity,  or  the  refractive 
index  modulation  obtained  in  recording 
a  uniform  grating  of  fixed  spatial  fre¬ 


quency  per  unit  (incident  or  absorbed) 
energy  density,  as  this  parameter  places 
an  upper  bound  on  the  maximum  rate 
of  interconnection  reprogrammings  that 
can  be  accomplished  per  unit  average 
power.  In  what  follows,  the  fundamen¬ 
tal  physical  limitations  on  the  photore¬ 
fractive  sensitivity  are  examined  in  a  bit 
more  detail. 

A  number  of  factors  contribute  to  the 
photorefractive  sensitivities  characteris¬ 
tic  of  photoconductive.  electrooptic 
materials.  One  such  factor  is  the  photo¬ 
generation  quantum  efficiency,  which 
represents  the  number  of  photogener¬ 
ated  mobile  charge  earners  per  photon 
absorbed  from  the  recording  beamts)  A 
second  factor  is  the  charge  transport 
efficiency,  which  is  a  measure  of  the 
degree  to  which  the  average  photogen¬ 
erated  mobile  charge  earner  contnbutes 
to  the  forming  space  charge  grating 
after  separation  from  its  onginal  site  by 
means  of  dnft  and/or  diffusion  and  sub¬ 
sequent  trapping.  The  magnitude  of  the 
space  charge  field  generated  by  a  given 
space  charge  grating  is  inversely  pro¬ 
portional  to  the  dielectric  permittivity  e 
of  the  photorefractive  material,  which 
thus  contributes  a  third  factor  to  the 
grating  recording  sensitivity.  A  fourth 
factor  describes  the  perturbation  of  the 
local  index  ellipsoid  (dielectric  tensor  at 
optical  frequencies)  that  results  from  a 
given  space  charge  field  through  the 
electrooptic  frequencies)  that  results 
given  space  charge  field  through  the 
electrooptic  (Pockels  or  Kerr)  effect. 

In  addition,  several  other  physical 
quantities  factor  into  an  evaluation  of 
the  photorefractive  sensitivity,  includ¬ 
ing  the  wavelength  of  the  recording  illu¬ 
mination  (to  convert  the  number  of 
absorbed  photons  into  an  equivalent 
energy),  the  absorption  coefficients  of 
the  material  at  the  wavelengths  of  both 
the  recording  and  readout  beams  (to 
correct  for  the  fractional  absorbance  of 
the  recording  beams  and  the  fractional 
transmittance  of  the  readout  beam),  and 
the  magnitude  of  the  applied  voltage 
(which  significantly  alters  the  sensitivity 
characteristics  for  certain  materials  by 
changing  the  nature  of  the  dominant 
charge  transport  mechanism  from  the 
diffusion  regime  to  the  drift  regime). 

In  order  to  provide  a  quantitative 
metric  that  does  in  fact  have  a  fun¬ 
damental  physical  limitation,  we  define 
the  grating  recording  efficiency :®  of  a  pho- 
torefractive  recording  model  or  configu¬ 
ration  as  the  magnitude  of  the  space 
charge  field  produced  by  a  fixed  num¬ 
ber  of  photogenerated  mobile  charge 
carriers  at  a  given  spatial  frequency. 


normalized  by  the  maximum  quantum 
limited  space  charge  field  that  can  be 
produced  by  the  same  number  of  photo¬ 
generated  carriers.  This  metric  thus 
effectively  combines  the  notions  of  a 
photogeneration  efficiency  and  a  charge 
transport  efficiency,  and  provides  an 
estimate  of  the  fundamental  quantum 
efficiency  of  the  photorefractive  grating 
recording  process.  For  simplicity,  we 
confine  our  attention  here  to  a  photore¬ 
fractive  model  charactenzed  by  a  single 
mobile  charge  species,  and  a  singie  type 
of  donor  site  with  associated  un-ionized 
donor  and  ionized  donor  (trap)  states. 

Let  us  compare  the  grating  recording 
efficiencies  of  four  idealized  grating 
recording  models,  generated  by 
combining  two  types  of  photogenera¬ 
tion  profiles  (a  comb  function  and  a 
sinusoid)  with  two  types  of  charge 
transport  processes  (a  translation  of 
each  photogenerated  charge  earner  bv 
exactlv  half  of  a  grating  wavelength; 
and  uniform  redistribution,  or  random¬ 
ization,  of  all  photogenerated  charge). 
The  resulting  four  combinations  are 
shown  schematically  in  Figure  2. 

The  maximum  quantum  limited  space 
charge  field  that  can  result  from  a  fixed 
number  of  photogenerated  mobile 
charges  is  given  by  the  bipolar  comb 
distribution,  which  generates  a  square 
wave  electric  field  profile  from  alternat¬ 
ing  positive  and  negative  sheets  of 
charge  spaced  by  half  of  a  grating  wave¬ 
length.  It  is  in  fact  the  first  harmonic 
component  of  this  electric  field  profile 
that  we  are  interested  in,  as  we  assume 
a  volume  grating  operated  deep  within 
the  Bragg  diffraction  regime.  The  bipo¬ 
lar  comb  distribution  can  thus  be 
assigned  a  grating  recording  efficiency 
of  unity. 

Bv  comparison,  the  monopolar  comb, 
transport-efficient  sinusoid,  and  base¬ 
line  sinusoid  cases  yield  grating  record¬ 
ing  efficiencies  of  1,2.  1  2.  and  1/4, 
respectivelv.  These  results  are  summa¬ 
rized  :n  Figure  3.  which  shows  the  evo¬ 
lution  of  the  first  harmonic  component 
of  the  space  charge  field  for  all  four 
cases  as  a  function  of  grating  recording 
time. 

The  saturation  behavior  of  each  case 
derives  from  assuming  a  fixed  finite  trap 
densitv  prior  to  grating  recording.  Since 
the  resultant  diffraction  efficiencies  of 
such  gratings  scale  as  the  square  of  the 
space  charge  field  (for  small  modula¬ 
tion),  these  results  imply  that  the  base¬ 
line  sinusoid  distribution  will  exhibit  a 
diffraction  efficiency  lower  than  the 
bipolar  comb  distribution  by  a  "actor  of 
16.  This  conclusion  is  of  considerable 
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Figure  2.  Schematic  diagram  depicting 
the  illumination  profile  and  charge 
transport  assumptions  underlying  each  of 
the  four  idealized  photorefractme 
recording  models  discussed  in  the  text. 


GRATING  RECORDING  TIME  (oro.  unml 


Figure  3.  The  evolution  of  the  space 
charge  field  as  a  function  of  grating 
recording  time  for  each  of  the  four 
idealized  photorefractive  recording 
models.  The  normalization  parameter 
E,  *  eN/v/eejKc.  in  which  NA  is  the 
maxiru^m  available  trap  density  and 
Kc  is  the  grating  wavevector. 


interest  because  the  baseline  sinusoid 
case  can  be  shown1’  to  represent  an 
asymptotic  upper  bound  on  the  grating 
recording  efficiency  predicted  by  more 
realistic  (rather  than  idealized)  photore¬ 
fractive  grating  recording  models  that 
assume  a  sinusoidal  illumination  pro¬ 
file.  In  fact,  several  photorefractive 
materials  have  been  demonstrated  to 
exhibit  photorefractive  sensitivities  that 
approach  the  upper  bound  represented 
by  the  baseline  sinusoid  case.18 

Nonetheless,  the  photorefractive 
grating  recording  process  is  from  this 


perspective  somewhat  quantum  ineffi¬ 
cient,  in  that  it  does  not  make  the  best 
possible  use  of  either  input  photons  or 
photogenerated  carriers.  In  view  of  the 
analysis  presented  above,  therefore,  it  is 
of  considerable  interest  to  imagine 
novel  engineered  photorefractive  mate¬ 
rials  with  donor  planes  spatially  sepa¬ 
rated  from  trap  planes,  illuminated  by  a 
comb-like  intensity  pattern  (which  can 
be  generated,  for  example,  by  utilizing 
stratified  volume  holographic  optical 
elements  [SVHOEs]20).  Such  materials 
could  find  applications  in  devices  based 
on  generation  of  a  carrier  grating  of 
given  spatial  frequency,  such  as  the 
Photorefractive  Incoherent-to-Coherent 
Optical  Converter  (PICOC).21 

Furthermore,  it  should  be  noted  that 
the  limits  derived  for  interconnections 
based  on  photorefractive  materials  per¬ 
tain  to  a  particular  physical  mechanism 
involving  photoexritation,  charge  trans¬ 
port,  and  electroopticaliy  induced  index 
changes.  The  importance  of  multiplexed 
interconnections  to  overall  schemes  for 
optical  processing  and  computing  sug¬ 
gests  continued  emphasis  on  the  search 
for  alternative  physical  effects  that  can 
provide  either  enhanced  sensitivity  or 
more  conveniently  implementable 
desirable  features  such  as  selective  era¬ 
sure.3 

Conclusions 

An  examination  of  both  the  fun¬ 
damental  physical  and  current  techno¬ 
logical  limitations  to  computational  per¬ 
formance  shows  that  in  terms  of 
throughput  per  unit  power,  digital 
(binary)  representations  (whether  elec¬ 
tronic  or  optical)  presently  operate 
many  orders  of  magnitude  away  from 
the  relevant  boundaries.  Analog  repre¬ 
sentations  and  detections  are  inherently 
much  more  power  consumptive  than 
digital  representations  and  detections, 
and  processing  schemes  based  on  ana¬ 
log  algorithms  must  demonstrate  con¬ 
siderably  enhanced  computational  com¬ 
plexity  prior  to  each  intermediate 
detection  plane  in  order  to  be  energy 
competitive  at  the  appropriate  thermal 
and  quantum  limits. 

Hybrid  architectures  and  algorithms 
(such  as  those  employed  by  numerous 
neural  network  models)  that  effectively 
combine  analog  representations  as 
weights  in  highly  multiplexed  parallel 
interconnections  with  binary  represen¬ 
tations  in  the  form  of  threshold  arrays 
(decision  planes)  may  ultimately  prove 
to  be  a  nearly  optimum  compromise. 
This  is  particularly  applicable  to  optical 
processing  and  computing  architec¬ 


tures,  for  which  available  analog  com¬ 
ponents  (spatial  light  modulators  and 
photorefractive  volume  holographic 
optical  elements)  operate  much  doser  to 
the  relevant  quantum  statistical  limits 
than  the  corresponding  digital  compo¬ 
nents  (threshold  arrays).  In  such  hybrid 
architectures,  the  threshold  array  per¬ 
forms  the  equivalent  of  a  parallel  level 
restore  function  (saturated  nonlinearity) 
that  is  essential  to  the  proper  conver¬ 
gence  of  many  processing  and  comput¬ 
ing  algorithms. 

Further  study  of  the  fundamental 
physical  limitations  that  apply  to  optical 
processing  and  computing  will  provide 
important  guidance  for  the  continued 
development  of  active  optical  materials, 
primarily  by  differentiating  between 
avenues  of  opportunity  with  large 
potential  gain  that  depend  on  key  mate¬ 
rials  parameters,  and  those  for  which 
current  materials  characteristics  are  suf¬ 
ficient  to  approach  relevant  perfor¬ 
mance  boundaries. 

Acknowledgments 

Research  on  the  fundamental  limita¬ 
tions  of  optical  information  processing 
and  computing  at  the  University  of 
Southern  California  is  supported  in  part 
by  the  Air  Force  Office  of  Scientific 
Research,  the  Defense  Advanced 
Research  Projects  Agency,  and  the 
Army  Research  Office. 

References 

1.  D.Z.  Anderson,  MRS  Bull,  (this  issue). 

2.  A.M.  Glass,  MRS  Bull,  (this  issue). 

3.  J.A.  Neff,  ed..  Opt.  Eng.,  Special  Issue  on 
Optical  Computing  24  (1)  (1985). 

4.  H.J.  Caulfield,  S.  Horvitz.  G.P.  Tncoles, 
and  W.A.  Von  Winkle,  eds.,  Proc.  IEEE.  Spe¬ 
cial  Issue  on  Optical  Computing  72  (7)  (1984). 

5.  A.R.  Tanguav  Jr.,  Opt.  Eng.  24  (1)  (1985) 

p.  2-18. 

6.  C.  Mead  and  L.  Conway,  Introduction  to 
VLSI  Svstems,  (Addison-Wesley,  Reading. 
MA,  1980)  p.  333-372. 

7.  R.W.  Keyes,  The  Physics  of  VLSI  Systems, 
(Addison-Wesley,  Reading,  MA.  1987) 

8.  R.W.  Keves,  Proc  IEEE,  63  (5)  (1985) 
p.  740-767. 

9.  P.W.  Smith,  Bell  Svstems  Tech.  /.  61  (1982) 
p.  1975-1993. 

10.  R.L.  Fork,  Phys.  Rev.  A.  26  (4)  (1982) 
p.  2049-2064. 

11.  R.W.  Keves,  Opt.  Acta  32  (5)  (1985)  p  525- 
535. 

12.  R.  Landauer,  in  Optical  Information  Pro¬ 
cessing,  edited  by  Yu.  E.  Nesterikhin,  G.W. 
Stroke,  and  W.E.  Kock  (Plenum  Press.  New 
York,  1976)  p.  219-253 

13.  C.H.  Bennett,  Int,  J.  Theor  Phvs  21  (12) 
(1982)  p.  905-941. 

14.  C.  Kvriakakis,  P.  Asthana,  R.V  Johnson, 
and  A.R.  Tanguay  Jr.,  Proc  Opt  Soc  Am 
Topical  Meeting  on  Spatial  Light  Modulators . 


40 


MRS  BULLETIN/AUGUST  1988 


Physical  and  Technological  Limitations  of  Optical  Information  Processing  and  Computing 


Lake  Tahoe,  Nevada,  (1988). 

15.  See  Reference  1  for  a  discussion  of  the 
P  usefulness  of  this  asymmetry. 

16.  A.M.  Class  and  D.  von  der  Linde,  Fer- 
roelectncs  10  (1976)  p  163-166, 

17.  f  Micheron.  Ferrociectrtcs  18  (1978) 
p.  153-159. 


18.  A.M.  Class,  Opt.  Eng  17  (5)  (1978)  p  470- 
479. 

19.  R.V.  Johnson  and  A.R.  Tanguav  Jr.,  in 
Optic  il  Processing  and  Computing,  edited  bv 
H.  Arsenault  and  T.  Szoplik  (Academic 
Press,  New  York,  1988),  (in  press). 

20.  R.V.  Johnson  and  A.R.  Tanguav  Jr..  Opt 


Lett.  13  3)  (1988)  p.  189-191. 

21.  A.  Marrakcfu.  A.R.  Tanguav  |r..  J.  Yu. 
and  D.  Psaihs,  Out.  Eng.  24  (1)  1 1935)  p.  124- 
131. 

22.  D.  von  der  Linde  and  A.M.  Class.  Aupl 

Phys.  8  '1975)  p.  35-100.  “ 


1UI  1 

APPENDIX  5 

L 

i 

1 

fel 

OPTICS  IN  1988 

Binary  optics  correction  of  typical  lenses. 


ly  add  the  power  of  laser  diode  arrays  (modular  laser  pow¬ 
er),  2)  beam  steering  or  wavefront  manipulation,  and  3) 
opto-electronic  integration  of  imager  focal  planes. 

Thus  far,  work  has  concentrated  on  the  technology  in¬ 
volved  in  designing  and  producing  individual  elements  for 
systems.  The  vast  potential  of  binary  optics  to  spawn  new 
products  and  industries  will  materialize  only  as  industrial 
expertise  develops,  system  designers  become  familiar  with 
its  cost,  weight,  and  design  benefits,  and  manufacturers 
put  it  in  production  through  replication,  embossing,  and 
forging  or  molding  from  a  single  master  element. 


Stratified  volume  holographic 
OPTICAL  ELEMENTS 

JLV.  Johnson  and  A.R.  Tanguay  Jr. 

Optical  Materials  and  Devices  Laboratory,  and 
Center  for  Photonic  Technology 
University  of  Southern  California 
Los  Angeles,  Calif. 

Holographic  optical  elements  are  of  fundamental  im¬ 
portance  to  a  number  of  applications  in  optical  in¬ 
formation  processing  and  computing,  including  optical  in¬ 
terconnections,  content-addressable  memories,  and  vari¬ 
ous  linear  and  nonlinear  signal  processing  configurations. 
Numerical  analyses  of  typical  volume  holographic  struc¬ 
tures  are  critical  for  characterizing  and  optimizing  such 
optical  elements.  One  of  the  most  flexible  and  broadly  ap¬ 
plicable  tools  for  studying  the  diffraction  behavior  of  vol¬ 
ume  holograms  is  the  optical  beam  propagation  method 
(BPM).1,2  In  this  method,  the  distributed  optical  inhomo¬ 
geneities  that  characterize  a  typical  hologram  are  approxi¬ 
mated  by  a  discrete  sequence  of  physically  and  mathemati¬ 
cally  simplified  elements:  infinitesimally  thin  phase  and/or 
polarization  modulation  layers,  interleaved  with  optically 


homogeneous  layers  of  finite  thickness.  By  approximating 
the  distnbuted  grating  with  a  sufficiently  large  number  ot 
these  discrete  elements,  the  resulting  numencal  model  of 
the  grating  can  be  brought  arbitrarily  close  to  that  of  the 
desired  distributed  bulk  grating. 

The  BPM  concept  of  separating  the  modulation  process 
from  the  diffraction  process  in  turn  suggests  a  new  class  of 
optical  devices:  the  stratified  volume  holographic  optical 
element,  or  SVHOE’’4  (see  figure).  The  SVHOE  device 
structure  consists  of  a  sequence  of  thin  photosensitive  ho¬ 
lographic  recording  layers  that  perform  the  optical  modu- 
latidn  function,  interleaved  with  optically  passive  buffer 
layers,  i.e.,  layers  that  impress  no  modulation  on  the  light 
beam  but  rather  allow  the  diffraction  processes  necessary 
for  thick  grating  response  to  occur.  A  granng  recorded  in 
any  individual  modulation  layer  would  necessarily  exhibit 
Raman-Nath  characteristics  because  each  recording  layer 
is  quite  thin.  But  for  a  given  angular  interval,  the  incorpo¬ 
ration  of  a  sufficiently  large  number  of  thin  granngs  leads 
to  a  structure  with  an  angular  alignment  sensitivity  that  is 
indistinguishable  from  that  of  the  bulk  grating.  Thus,  an 
SVHOE  structure  can  closely  emulate  the  Bragg  response 
of  a  bulk  granng,  using  surprisingly  few  thin  grating  lay¬ 
ers. 

In  addition,  this  structure  exhibits  features  not  found  in 
bulk  gratings  that  may  prove  useful  for  certain  system  ap¬ 
plications.  For  example,  illumination  of  an  SVHOE  struc¬ 
ture  with  focused  monochromatic  light  produces  uniform¬ 
ly  spaced  angular  response  peaks,  which  can  be  trans¬ 
formed  with  a  lens  to  produce  an  array  of  equally  spaced 
points.  This  function  is  useful  for  interconnection  of  opo- 
cal  cellular  logic  arrays.  For  example,  illumination  with 
850  nm  light  of  a  two  grating  SVHOE  with  a  grating  sepa¬ 
ration  of  1  cm  and  a  grating  spatial  frequency  of  350  cy- 
des/mm  at  F/3.3  will  produce  in  excess  of  1000  regularly 
spaced  diffracted  beams.  The  occurrence  of  multiple  (peri¬ 
odic)  peaks  in  the  angular  response  profile  can  be  applied 
to  angular  encoding  applications,  in  which  a  two  granng 
SVHOE  mounted  on  the  part  to  be  tracked  can  be  probed 
by  a  single  beam  to  allow  coarse  angular  measurement  by 
peak  counting,  and  fine  angular  measurement  by  interpo¬ 
lation.  Highly  selective  wavelength  notch  filtering  can  also 
be  accomplished  in  SVHOE  structures  by  appropriate 
choice  of  buffer  layer  thickness  and  grating  spatial  fre¬ 
quency.5 

Further,  for  real-time  material  implementations  of 
SVHOE  structures  in  which  the  photoresponse  for  granng 
fortnation  can  be  either  optically  or  electrically  switched 
on  a  layer-by-layer  basis,  a  structure  with  a  given  number 
of  layers  can  interconnect  either  every  element  or  every 
pth  element  with  the  source,  in  a  programmable  manner. 
For  example,  such  layer-by-layer  programmability  may 
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The  SVHOE  structure,  showing  the  recording  of  a  grat¬ 
ing  with  wave  vector  by  means  of  two  coherent,  col¬ 
limated  recording  beams. 


prove  feasible  by  either  optical  bleaching  or  electrical  tun¬ 
ing  of  the  exciton  resonance  in  a  multiple  quantum  well 
structure,  or  by  modulation  of  a  doping  superlattice. 
SVHOE  structures  can  also  be  conceived  that  exhibit  en¬ 
tirely  different  diffraction  behavior  in  response  to  grating 
formation  by  distinct  writing  wavelengths,  by  actively  al¬ 
tering  the  layer  photosensitivity  spectrum  on  a  layer-by- 
layer  basis. 

These  novel  features  of  SVHOEs  collectively  allow  for 
numerous  additional  degrees  of  freedom  in  the  formula¬ 
tion  of  both  passive  and  active  holographic  elements. 
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16  GB/S  LIGHTWAVE  TRANSMISSION 
BY  OPTICAL  TIME-DIVISION 
MULTIPLEXING 

Rodney  S.  Tucker,  Gadi  Eisenstelw  and  Steven  K. 
Korotky 

AT&T  Bell  Laboratories 
Crawford  Hill  Laboratory 
Holmdel,  N.J. 

Optical  data  at  a  bit  rate  of  16  Gb/s  were  recently 
transmitted  over  optical  fiber  in  an  experiment  at 
AT&T  Bell  Laboratories.  This  is  the  highest  transmission 
hit-rate  reported  for  any  single-channel  lightwave  system 
—an  achievement  made  possible  using  optical  technology 
to  time-division  mulnplex  and  demultiplex  the  data. 

Commercial  lightwave  transmission  systems  use  elec¬ 
tronic  time-division  multiplexing  (TDM)  to  combine  a 
number  of  electronic  data  channels  into  a  single  higher 
capacity  channel  for  transmission.  In  essence,  time-divi¬ 
sion  multiplexing  is  a  digital  technique  in  which  data  bits 
are  interleaved  in  time.  As  bit-rates  move  into  the  multi- 
agabit  per  second  range,  the  speed  of  available  electronics 
is  being  pushed  to  the  limit  and  the  electronic  multiplexing 


circuity  causes  a  speed  bottleneck.  The  optical  time-divi¬ 
sion  multiplexing  techniques  used  in  the  Bell  Laboratories 
experiments  retain  the  digital  format  of  the  signal,  but 
overcome  the  speed  bottleneck.  This  is  achieved  by  capi¬ 
talizing  on  the  inherent  wide-band  capabilities  of  optical 
compopents  such  as  pulsed  semiconductor  lasers  and  lithi¬ 
um  niobate  (LiNbOi)  directional  coupler  switches. 

In  the  16  Gb/s  optical  time-division  multiplexed  system, 
four  optical  data  streams  at  4  Gb/s  are  time-mulnplexed 
directly  in  the  optical  domain  to  give  a  16  Gb/s  data 
stream  for  transmission.  At  the  receiver  end  of  the  system, 
the  1 6  Gb/s  data  stream  is  demultiplexed  optically  to  four 
lower  bit-rate  optical  streams  before  detecnon  and  conver¬ 
sion  to  the  electrical  domain. 

The  4  Gb/s  input  data  streams  originate  from  short  op¬ 
tical  pulses  generated  by  four  mode-locked  semiconductor 
lasers.  Data  are  encoded  on  the  pulses  using  LiNbOi 
waveguide  optical  modulators.  Optical  multiplexing  is 
earned  out  by  simply  delaying  the  opneal  pulses  streams 
and  combining  them  in  a  passive  fiber  directional  coupler 
array. 

The  demultiplexer,  the  most  important  component  in 
the  system,  separates  the  pulses  on  the  incoming  16  Gb  s 
stream  into  four  streams  each  at  4  Gb/s.  A  block  diagram 
of  the  demultiplexer  anu  its  associated  electronics  is 


OPTICS  NEWS  ■  DECEMBER  I98N  .11 


APPENDIX  6 


Reprint  from 

Topics  in  Applied  Physics,  Volume  62: 

Photorefractive  Materials  and  Their  Applications  II 

Survey  of  Applications 

Editors:  P  Gunter  and  J.-P.  Huignard 


£  Springer- Verlag  Berlin  Heidelberg  1989 
Printed  in  Germany.  Not  for  Sale. 

Repnnt  only  allowed  with  permission  from  Springer-Verlag 


SSpringer-Verlag 

Berlin  Heidelberg  New  York 
London  Paris  Tokyo 


Photorefractive  Materials  and  Their  Applications  II 

Survey  of  Applications 

Editors:  P.  Gilmer  and  J.-P.  Huignard 

1.  Introduction.  By  J.-P.  Huignard  and  P.  Gunter 

2.  Amplification,  Oscillation,  and  Light-Induced  Scattering  in 
Photorefractive  Crystals.  By  S.G.  Odouiov  and  M.S.  Soskin 
(With  25  Figures) 

3.  Photorefractive  Effects  in  Waveguides.  By  V.  E.  Wood,  P.  J.  Cressman, 
R.  L.  Holman,  and  C.  M.  Verber  (With  15  Figures) 

4.  Wave  Propagation  in  Photorefractive  Media.  By  J  O.  White, 
Sze-Keung  Kwong,  M  Cronin-Golomb.  B.  Fischer,  and  A.  Yariv 
(With  34  Figures) 

5.  Phase-Conjugate  Mirrors  and  Resonators  with  Photorefractive 
Materials.  By  J.  Feinberg  and  K.  R.  MacDonald  (With  35  Figures) 

6.  Optical  Processing  Using  Wave  Mixing  in  Photorefractive  Crystals. 

By  J.-P.  Huignard  and  P.  Gunter  (With  56  Figures) 

7.  The  Photorefractive  Incoberent-To-Coberent  Optical  Converter. 

By  J.  W.  Yu,  D.  Psaltis,  A.  Marrakchi.  A.R.  Tanguay.  Jr., 
and  R.  V.  Johnson  (With  32  Figures) 

8.  Photorefractive  Crystals  in  PRIZ  Spatial  Light  Modulators. 

By  M.P.  Petrov  and  A.  V.  Khomenko  (With  12  Figures) 


7.  The  Photorefractive  Incoherent-To-Coherent 
Optical  Converter 

Jeffrey  \\  .  \u.  Demetn  Psaltis,  Abdellatif  Marrakchi.  Armand  R.  Tanguay,  Jr., 
and  Richard  V.  Johnson 


With  32  Figures 


7.1  Overview 

High  performance  spatial  light  modulators  (SLMs)  are  essential  in  many  optical 
information  processing  and  computing  applications  for  converting  incoherent 
images  to  coherent  replicas  suitable  for  subsequent  processing  (7.1. 2],  A  typical 
spatial  light  modulator  consists  of  a  photosensitive  element  to  capture  the  inco¬ 
herent  light  image  and  an  optical  modulator  element  to  impress  the  incoherent 
input  image  content  onto  a  coherent  readout  beam.  A  particularly  important 
class  of  spatial  light  modulators  employs  photorefractive  crystals  that  combine 
both  photosensitive  and  electrooptic  modulation  functions  within  the  same 
medium.  Examples  of  electrooptic  spatial  light  modulators  that  utilize  photore¬ 
fractive  crystals  include  the  PockeLs  Readout  Optical  Modulator  ( PROM)  [7.3] 
and  the  PRIZ  (a  Soviet  acronym  for  a  crystallography  cally  modified  PROM) 
[74]. 

During  operation  of  the  Pockels  Readout  Optical  Modulator,  the  input 
image-bearing  beam  creates  photoinduced  carriers  which  are  longitudinally 
separated  by  an  applied  bias  electric  field.  This  charge  separation  produces  a 
space-vanant  division  of  the  applied  field  across  the  active  electrooptic  crystal 
and  one  or  more  dielectric  blocking  layers,  as  shown  in  the  upper  left  quadrant 
of  Fig.  7.1.  The  local  birefringence  of  the  medium  depends  on  the  longitudinal 
component  of  the  local  electric  field,  and  hence  can  be  sensed  by  a  polar¬ 
ized  readout  beam  observed  through  an  exit  analyzer.  In  the  PRIZ,  the  same 
charge  generation  and  separation  process  is  utilized,  with  the  difference  that 
the  crystallographic  orientation  is  chosen  to  emphasize  readout  sensitivity  to 
the  transverse  components  of  the  induced  electric  field  distribution,  as  shown 
in  the  upper  right  quadrant  of  Fig.  7.1. 

The  PROM  and  the  PRIZ  are  typically  limited  in  spatial  frequency  re¬ 
sponse  to  of  order  10  cycles /mm  at  optimum  optical  exposure  [7.5],  and  hence 
are  Limited  to  relatively  modest  bandwidth  optical  processing  and  commut¬ 
ing  applications.  The  physical  configurations  of  the  PROM  and  PRIZ  devices 
do  not  lend  themselves  readily  to  exploitation  of  the  remarkably  high  spatial 
band  widths  available  in  holographic  configurations,  in  which  the  input  image 
is  encoded  on  a  spatial  carrier  (as  shown  in  the  lower  left  quadrant  of  Fig.  7.1 
and  discussed  extensively  elsewhere  in  this  book).  Even  when  utilizing  the 
same  electrooptic  crystal  as  in  the  PROM  and  PRIZ  (typically  bismuth  silicon 
oxide),  holographic  recording  configurations  employing  transverse  applied  elec- 


ELECTROOPTIC  SPATIAL  LIGHT  MODULATORS 
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Fig.  7.1.  Schematic  diagram  of  the  principal  component*  of  the  applied  voltage,  charge 
transport,  sensed  electnc  field  components,  and  input  light  wave  vectors  for  four  types  of 
spatial  light  modulators  that  utilise  single  crystal  bismuth  silicon  oxide  (BiuSiOjo)  The 
acronym  VHOE  stands  for  volume  holographic  optical  element;  likewise  PROM  stands  for 
the  Pockel*  Readout  Optical  Modulator,  PRIZ  is  a  Soviet  acronym  for  a  crystallographically 
modified  PROM,  and  PICOC  stands  for  the  photo  re  tractive  incoherent- to-coherent  optical 
converter,  the  subject  of  this  chapter 


trie  fields  and  no  blocking  layers  have  been  shown  to  exhibit  spatial  frequency 
bandwidths  in  excess  of  2000  cycles/ mm.  However,  such  purely  holographic 
recording  requires  input  of  image-dependent  information  on  one  of  two  coher¬ 
ent  input  beams,  and  as  such  cannot  be  directly  utilized  for  performing  the 
incoherent-to-coherent  conversion  function. 

A  fourth  distinct  type  of  photorefractive  spatial  light  modulator  has  been 
independently  proposed  by  Karruhtltn  and  Petrov  [7.6]  and  by  the  present  au¬ 
thors  [7.7,8]  which  combines  the  incoherent-to-coherent  conversion  function 
with  an  essentially  holographic  recording  process,  and  thereby  exhibits  several 
advantages  of  each.  As  in  the  holographic  recording  case,  a  transverse  applied 
electric  field  is  used  in  conjunction  with  two  uniform  coherent  writing  beams 
to  produce  a  volume  grating  that  is  then  selectively  modified  by  a  third  beam 
encoded  with  the  information  content  to  be  stored  or  converted.  This  configu¬ 
ration  is  shown  schematically  in  the  lower  right  quadrant  of  Fig.  7.1.  The  Pho¬ 
torefractive  Incoherent-to-Coherent  Optical  Converter  (PICOC)  [7.7,8]  device 
is  capable  of  recyclable  real  time  operation,  is  characterized  by  an  enhanced 
spatial  frequency  response,  and  is  even  simpler  to  construct  than  the  PROM 
or  PRIZ.  In  addition,  the  PICOC  device  configuration  allows  its  use  in  many 
quasi-holographic  techniques  (such  as  optical  phase  conjugation),  which  in  turn 
lead  to  potentially  novel  optical  information  processing  and  computing  archi¬ 
tectures  [7.9-11]. 
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The  photorefractive  mcoherent-to-coherent  optical  conversion  process  is 
described  in  Sect.  7.2.  as  are  alternative  sequencing  schemes  and  optical  imple¬ 
mentations.  In  Sect.  7.3.  the  limiting  assumptions  needed  to  derive  a  tractable 
analytical  model  of  PICOC  performance  are  specified,  in  preparation  for  de¬ 
tailed  discussion  of  the  recording  stage  in  Sect.  7.4,  and  of  the  readout  stage  in 
Sect.  7.5.  Conclusions  and  future  research  directions  are  offered  in  Sect.  7.6. 


7.2  Physical  Principles  and  Modes  of  Operation 

The  photorefractive  incoherent-to-coherent  optical  conversion  (PICOC)  process 
is  perhaps  best  understood  as  an  extension  of  the  more  familiar  holographic 
recording  process  in  a  photorefractive  medium.  The  physical  principles  gov¬ 
erning  such  recording  sue  briefly  reviewed  in  this  section,  and  in  much  greater 
detail  in  Sect.  7.4.  The  extension  of  this  recording  process  to  include  PICOC 
allows  for  at  least  three  different  temporal  modes  for  sequencing  the  coherent 
grating  with  respect  to  the  incoherent  image.  These  modes  are  identified  and 
compared  in  this  section.  In  addition,  two  alternative  optical  architectures  Me 
defined,  and  converted  images  generated  by  one  representative  configuration 
are  presented. 

The  high  sensitivity  of  photoconductive  and  electrooptic  crystals  such  as 
bismuth  silicon  oxide  (Bi^SiOro,  or  BSO)  in  the  visible  portion  of  the  spec¬ 
trum  has  allowed  the  simultaneous  recording  and  reading  of  volume  holograms 
to  be  achieved  with  time  constants  amenable  to  real-time  operation  [7.12].  The 
holographic  recording  process  in  photorefractive  materials  involves  photoexci¬ 
tation,  charge  transport,  and  trapping  mechanisms  [7.13].  When  two  coherent 
beams  are  allowed  to  interfere  within  the  volume  of  such  a  crystal,  free  c tuners 
eue  nonuniformly  generated  by  absorption  and  Me  redistributed  by  diffusion 
and/or  drift  under  the  influence  of  an  externally  applied  electric  field.  Subse¬ 
quent  trapping  of  these  chMges  at  relatively  immobile  trapping  sites  generates  a 
stored  space- chMge  field,  which  in  turn  modulates  the  refractive  index  through 
the  lineM  electrooptic  (Pockels)  effect  and  thus  records  a  volume  phase  holo¬ 
gram.  If  both  coherent  writing  beams  are  plane  waves,  the  induced  hologram 
consists  of  a  uniform  grating. 

In  the  photorefractive  incoherent- to- coherent  optical  conversion  (PICOC) 
process,  an  incoherent  image  is  focused  in  the  volume  of  the  photorefractive 
material  in  addition  to  the  coherent  grating  beams,  creating  an  additional  spa¬ 
tial  modulation  of  the  chMge  distribution  stored  in  the  crystal.  This  spatial 
modulation  can  be  transferred  onto  a  coherent  readout  beam  by  reconstructing 
the  holographic  grating.  The  spatial  modulation  of  the  coherent  reconstructed 
beam  will  then  be  a  negative  replica  of  the  input  incoherent  image,  as  shown 
in  Fig.  7.2.  It  should  be  noted  here  that  a  related  image  encoding  process  can 
be  implemented  nonholographically  by  premultiplication  of  the  image  with  a 
grating  [7.14]. 
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Fig .  7.2.  Principle  of  operation  of  the  photorefractive  incoherent- to-coherent  optical  con¬ 
verter  (hereinafter  referred  to  ae  PICOC) 


Fig.  7.3.  PICOC  in  the  grating  eraaure  mode  (GEM),  in  which  the  carrier  grating  is  recorded 
before  the  incoherent  image-bearing  signal 
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In  PICOC.  the  holographic  grating  can  be  recorded  before,  during,  or 
after  the  crystal  is  exposed  to  the  incoherent  image.  Therefore,  severed  distinct 
operating  modes  are  possible.  These  include  the  grating  erasure  mode  (GEM; 
Fig.  7.3):  the  grating  inhibition  mode  (GIM;  Fig.  7.4),  and  the  simultaneous 
erasure/writing  mode  (SEWM:  Fig.  7.5). 

In  the  grating  erasure  mode  (GEM),  shown  schematically  in  Fig.  7.3,  a 
uniform  grating  is  first  recorded  by  interfering  two  coherent  writing  beams  in 
the  photorefractive  crystal.  The  writing  beams  are  turned  off,  and  this  grating  is 
then  selectively  erased  by  incoherent  illumination  of  the  crystal  with  an  image¬ 
bearing  beam.  The  incoherent  image  may  be  incident  either  on  the  same  face  of 
the  crystal  as  the  writing  beams,  or  on  the  opposite  face.  When  the  absorption 
coefficients  at  the  writing  and  image-bearing  beam  wavelengths  give  rise  to 
significant  depth  honuniformity  within  the  crystal,  these  two  cases  will  have 
distinct  wavelength-matching  conditions  for  response  optimization  [7.8]. 

In  the  grating  inhibition  mode  (GIM),  shown  schematically  in  Fig.  7.4, 
the  crystal  is  pre-illuminated  with  the  incoherent  image- bearing  beam  prior 
to  grating  formation.  This  serves  to  selectively  decay  (enhance)  the  applied 
transverse  electric  field  in  exposed  (unexposed)  regions  of  the  crystal.  After 
this  pre-exposure,  the  coherent  writing  beams  are  then  allowed  to  interfere 
within  the  crystal,  causing  grating  formation  with  spatially  varying  efficiency 
due  to  significant  differences  in  the  local  effective  applied  field. 

In  the  simultaneous  erasure /writing  mode  (SEWM),  shown  schematically 
in  Fig.  7.5,  the  incoherent  image  modulation,  the  coherent  grating  formation 
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Fig.  T.4.  PICOC  in  the  grating  inhibition  mode  (CIM),  in  which  the  carrier  grating  ie 
recorded  after  the  incoherent  image-bearing  signal 
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Fig.  7.5.  PICOC  in  the  simultaneous  erasure/ writing  mode  (SEWM),  in  which  the  earner 
grating  and  the  incoherent  image-bearing  signal  are  recorded  simultaneously 


process,  and  the  readout  function  are  performed  simultaneously.  A  stable  image 
is  transcribed  after  the  space-charge  field  has  reached  steady  state. 

A  further  distinction  can  be  made  in  the  operating  modes  between  strictly 
cyclic  exposure /readout,  with  an  upper  limit  on  the  readout  time  interval,  and 
operation  in  which  prolonged  readout  times  are  required.  Cyclic  readout  can  be 
achieved  by  any  one  of  the  three  sequencing  modes  introduced  above,  and  the 
sensitometry  requirement  for  achieving  good  quality  images  can  be  expressed  in 
terms  of  optical  exposure  (i.e.,  optical  energy  per  unit  area).  When  prolonged 
readout  is  required,  degradation  of  the  stored  space-charge  profile  can  occur, 
due  either  to  dark  current  or  erasure  induced  by  the  optical  readout  beam.  The 
most  appropriate  sequencing  mode  for  prolonged  readout  is  the  simultaneous 
erasure/writing  mode  (SEWM)  because  it  constantly  regenerates  the  space- 
charge  field  profile.  However,  the  sensitometry  requirement  for  this  latter  mode 
is  better  expressed  in  terms  of  optical  power  rather  than  optical  exposure,  as¬ 
suming  readout  time  intervals  long  compared  with  the  time  required  to  achieve 
a  stable  steady  state  readout  image.  In  exchange  for  this  optical  energy  penalty, 
SEWM  offers  a  considerably  simplified  experimental  configuration  with  no  need 
for  temporal  sequencing,  a  much  greater  tolerance  for  photorefractive  crystals 
with  increased  dark  conductivity,  and  readout  of  essentially  unlimited  duration. 
Readout  light  beams  of  much  shorter  wavelength  and/or  much  higher  intensi¬ 
ties  can  be  accommodated  in  SEWM  without  incurring  unacceptable  erasure 
of  the  charge  pattern.  Because  of  its  experimental  convenience  and  analytical 
simplicity.  SEWM  is  emphasized  in  this  chapter,  with  more  detailed  discussion 
of  GEM  aud  GIM  given  later  in  Sect.  7.4.4. 

Optical  Implementations.  The  original  implementation  of  PICOC  described  by 
Komthtlin  and  Petrov  (7.6]  is  a  modification  of  the  nondegenerate  four-wave 
mixing  geometry  to  include  simultaneous  exposure  by  an  incoherent  image¬ 
bearing  beam.  This  configuration  requires  a  readout  wavelength  separate  and 
distinct  from  the  coherent  grating  writing  wavelength,  which  then  allows  the 
readout  wavelength  to  be  selected  for  significantly  reduced  grating  erasure 
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rates.  Thus,  the  grating  inhibition  mode  (GIM)  and  the  grating  erasure  mode 
'  GEM)  are  best  implemented  in  this  configuration.  However,  the  optica]  align¬ 
ment  is  more  intricate  with  this  architecture  them  it  is  with  the  degenerate 
four-wave  mixing  geometry  introduced  next,  since  the  Bragg  angle  of  the  read¬ 
out  beam  will  not  be  the  same  as  the  Bragg  angle  of  the  coherent  writing  beams 
( shown  in  Fig.  7.6). 

An  alternative  optical  implementation  is  a  modification  of  the  conventional 
degenerate  four-wave  mixing  geometry  to  include  simultaneous  exposure  by  an 
incoherent  image-bearing  beam,  as  shown  in  Fig.  7.7.  This  implementation  has 
the  advantage  of  extremely  easy  optical  alignment,  as  the  readout  beam  is 
readily  Bragg  aligned  by  retroreflecting  one  of  the  two  coherent  grating  writing 
beams.  It  has  the  disadvantage  that  the  readout  beam,  being  at  the  same  wave¬ 
length  as  the  coherent  grating  writing  beams,  must  erase  the  grating  structure 
being  probed  at  rates  comparable  to  the  writing  process,  assuming  a  readout 
light  intensity  comparable  to  the  writing  light  intensities.  Thus  this  implemen¬ 
tation  can  be  utilized  for  the  simultaneous  erasure/writing  mode  (SEWM),  and 
can  be  adopted  for  the  grating  erasure  mode  (GEM)  and  the  grating  inhibition 
mode  (GIM)  only  by  significantly  reducing  the  probe  beam  intensity,  with  a 
correspondingly  reduced  readout  signal  intensity. 

As  a  specific  example  of  the  PICOC  process,  consider  a  degenerate  four- 
wave  mixing  configuration  in  which  the  coherent  writing  beams  and  the  inco¬ 
herent  image-bearing  beam  were  made  to  illuminate  the  same  face  of  a  1.3  mm 
thick  crystal  of  bismuth  silicon  oxide,  obtained  from  Crystal  Technology,  Inc. 
An  electric  field  of  4kV/cm  was  applied  along  the  (110)  axis,  as  shown  in 


Fig.  7.6.  Nondegenerate  four-wav*  mixing  architecture  to  perform  ths  photorefr  active  inco¬ 
herent- to-coherent  optical  conversion 
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Fig.  7.7.  Degenerate  four-wave  mixing  architecture  to  perform  the  photorefractive  incoherent- 
to-coherent  optical  converaion 


Fig.  7.8.  Biamuth  silicon  oxide  crystal  orientation  for  the  PICOC  transverse  electrooptic 
configuration  and  recording  geometry.  The  volume  holographic  grating  with  wave  vector  Kg 
it  formed  by  the  coherent  writing  beams  l\  and  7j,  and  the  incoherent  image  information  is 
encoded  on  beam  /$ 

Fig.  7.8.  A  300  cycles/mm  grating  was  written  by  the  515  nm  line  of  an  argon 
ion  laaer  with  the  grating  wave  vector  oriented  parallel  to  the  applied  bias  field 
to  maximize  the  diffraction  efficiency.  The  image- bearing  light  source  was  either 
a  xenon  arc  lamp,  a  tungsten  lamp,  or  the  488  nm  line  of  the  argon  ion  laser. 
The  average  coherent  grating  intensity  was  0.4  mW/cm2  and  the  image-bearing 
light  intensity  was  typically  8.0 mW/cm2.  The  coherent  grating  writing  beams 
were  polarized  orthogonal  to  the  applied  electric  field.  A  polarizer  was  inserted 
at  the  output  to  minimize  coherent  optical  scatter  from  the  crystal  [7.15], 


282 


Sample  converted  images  obtained  from  two  binary  Tansparencies  i  a  spoke 
•arzet  and  an  US  Air  Force  resolution  target'  and  from  two  black-and-white 
slides  with  continuous  tone  gray  scale  are  shown  in  Fie.  7  9  The  original  rrans- 
narer.cy  and  its  converted  image  have  reversed  contrast,  as  shown  m  this  figure 
and  a#  explained  by  Fig.  7  2.  An  approximate  resolution  of  15  line  pairs  mm 
determined  from  the  resolution  target  image >  was  achieved  without  optimizing 
factors  such  as  the  Bragg  readout  condition.  Such  optimization  results  in  strik¬ 
ing  enhancements  of  the  resolution  to  of  order  50  line  pairs-  mm.  as  discussed  in 
Sect.  7.5  below.  Similar  images  of  comparable  quality  have  also  been  recorded 
in  a  bismuth  silicon  oxide  crystal  in  which  the  (001)  axis  is  aligned  parallel  to 
'he  coherent  grating  wave  vector  and  to  the  applied  bias  electric  field. 

Having  reviewed  in  broadest  terms  the  physical  principles  and  modes  of 
operation  of  the  PICOC  process,  let  us  now  delineate  the  scope  and  limitations 


Fig.  7.9a-d.  Example*  of  th*  conversion  of  binary  and  gray-level  tranaparencies:  (a)  spoke 
target  (b)  U  S.  .Air  Force  resolution  target,  (c)  airplane,  and  (d)  an  incoherent  student 
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of  the  analytical  model,  as  given  in  the  next  section,  in  preparation  for  more 
detailed  studies  of  the  recording  process  in  Sect.  7.4  and  the  readout  process  in 
Sect.  7.5. 


7.3  Delineation  of  the  Analytical  Model 

A  reasonably  accurate  and  complete  model  of  the  photorefractive  incoherent-to- 
coherent  optical  conversion  process  is  based  upon  a  set  of  equations  that  govern 
trap  and  electron  balance,  electron  transport,  and  the  buildup  of  a  space-charge 
field,  as  detailed  in  Sect.  7.4.1.  This  model  exhibits  both  nonlocal  response 
due  to  charge  transport  and  striking  nonlineaiities.  No  analytical  solution  has 
yet  been  identified  that  is  broadly  applicable  to  the  full  range  of  important 
experiments  (e.g.,  experiments  involving  large  modulation  depths  and  photo- 
induced  variations  in  the  recombination  time).  Numerical  solutions  are  certainly 
feasible,  but  have  not  yet  been  fully  explored.  To  help  refine  physical  insight  into 
the  conversion  process,  an  approximate  model  capable  of  analytic  solution  needs 
to  be  defined,  but  its  interpretation  must  be  tempered  with  careful  attention 
to  its  limitations. 

Two  such  approximate  solution  models  are  identified  herein.  The  first  ap¬ 
proach.  the  “constant  recombination  time  approximation,”  is  based  upon  the 
analytical  studies  of  Mohanm  et  al.  [7.16],  and  was  presented  previously  by 
the  authors  [7.8].  This  approach  is  discussed  in  sufficient  detail  in  Sect.  7.3.1 
to  enable  comparison  with  the  second  approach,  the  “perturbation  series  ap¬ 
proximation,”  as  introduced  in  Sect.  7.3.2.  These  two  approaches  are  compared 
and  contrasted  in  this  section.  The  perturbation  series  approximation  is  then 
used  as  the  basis  for  continued  discussion  of  PICOC  performance  character¬ 
istics  throughout  the  remainder  of  the  chapter,  and  is  detailed  more  fully  in 
Sect.  7.4.1. 

7.3.1  Constant  Recombination  Time  Approximation 

One  such  approximate  model  of  the  PICOC  process  [7.8]  evolves  from  analytic 
solutions  of  single  spatial  frequency  grating  recording  as  derived  by  Mokaram 
et  al.  [7.16].  This  model  approximates  quite  well  the  nonlinearity  of  the  con¬ 
version  process,  but  it  is  limited  in  scope  to  steady  state  behavior.  Hence  the 
analytical  solutions  of  this  model  (derived  in  [7.8])  are  suitable  only  for  study¬ 
ing  the  simultaneous  erasure/ writing  mode  (SEWM)  in  the  steady  state  regime, 
and  cannot  cope  with  the  grating  erasure  mode  (GEM)  and  the  grating  inhi¬ 
bition  mode  (GIM)  response.  A  further  limitation  of  this  approximation  is  its 
assumption  of  a  constant  recombination  time  ( which  is  equivalent  to  assuming 
an  infinite  trap-limited  saturation  field  strength),  and  so  this  will  hereinafter  be 
referred  to  as  the  “constant  recombination  time”  model.  A  more  accurate  study 
of  the  photorefractive  effect,  as  defined  by  a  set  of  photoconductivity  equations, 
allows  for  photoinduced  variations  of  the  recombination  time  from  its  nominal 
value.  Although  these  variations  may  be  quite  small  in  amplitude  compared 
with  the  nominal  value,  they  nevertheless  provide  a  very  important  mecha- 
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msm  that  can  significantly  modify  the  space-charge  field  for  certain  recording 
configurations. 

A  key  assumption  of  the  constant  recombination  time  model  is  the  absence 
of  self-diffraction.  Self-diffraction  is  the  process  by  which  the  grating  written  at 
the  entrance  of  the  photorefractive  crystal  diffracts  a  portion  of  the  coherent 
writing  beams,  modifying  the  interference  pattern  and  hence  the  grating  that 
is  recorded  deeper  in  the  crystal  (7.13. 17. 37],  This  process  can  be  neglected  for 
sufficiently  thin  crystals  and  low  diffraction  efficiencies  fe.g..  2mm  of  bismuth 
silicon  oxide)  and  allows  a  considerable  simplification  of  the  analysis.  If  alter¬ 
native  crystals  with  significantly  higher  electrooptic  coefficients  and/or  thicker 
crystals  are  used,  then  the  diffraction  efficiency  would  increase,  self-diffraction 
effects  would  be  far  more  pronounced,  and  thus  the  mathematical  framework 
described  in  this  chapter  would  need  to  be  modified. 

7.3.2  Perturbation  Series  Approximation 

By  recasting  the  chosen  set  of  photorefractive  equations  in  Fourier  transform 
space,  additional  physical  insights  into  the  transcription  process  can  be  achieved 
which  lead  naturally  to  a  perturbation  series  formulation  of  the  conversion 
process.  Analytic  solutions  can  be  derived  for  the  first  few  terms  of  this  series 
without  restricting  such  solutions  to  the  steady  state  regime,  and  so  define  an 
additional  approximate  model  of  the  PICOC  process.  Limiting  the  analysis  to 
the  first  few  terms  means  that  the  strong  nonlinearities  that  characterize  the 
incoherent-to-coherent  optical  conversion  process  are  estimated  at  best,  but  the 
most  compelling  advantage  of  this  approach  is  its  ability  to  model  the  temporal 
evolution  of  the  space-charge  field.  Study  of  the  temporal  behavior  is  crucial 
to  the  analysis  of  the  grating  erasure  mode  (GEM)  and  the  grating  inhibition 
mode  (GIM).  In  this  chapter,  we  use  a  perturbation  model  that  predicts  the 
various  spatial  frequency  components  of  the  space-charge  field. 

Consider  for  example  a  coherent  grating  beam  /q(x)  of  the  form 

/G(x)  =  fo(l  +  mG  Aqx)  •  (7.1) 

in  which  mG  is  the  modulation  depth  and  Kq  is  the  wave  vector  associated 
with  the  coherent  grating,  and  an  incoherent  signal  beam  /s(x)  of  similar  form 

7s(x)  =  7i(l +ms  cos  ffsx)  .  (7.2) 

in  which  ms  is  the  modulation  depth  and  Ks  is  the  wave  vector  associated  with 
the  image  profile.  The  coordinate  system  is  defined  such  that  x  is  parallel  to  the 
applied  bias  electric  field  Eq,  which  is  also  parallel  to  the  coherent  grating  wave 
vector  Kq;  z  is  orthogonal  to  the  entrance  face  of  the  photorefractive  crystal; 
y  is  defined  to  complete  a  right-handed  coordinate  system.  Because  of  the 
nonlinearities  in  the  recording  process,  the  space-charge  field  E{x)  transcribed 
by  these  light  beams  contains  spatial  frequencies  that  do  not  exist  in  the  original 
light  profiles.  These  intermodulation  terms  have  the  form 
+oo  +oo 

E(x)«  £  £  Emn exp(i(m/fG  +n/fs)*]  •  (7.3) 

mss-oo  nst-oo 
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If  the  recording  process  were  perfectly  linear,  then  only  terms  such  as  £10  (i.e.. 
m  =  1  and  n  =  0)  and  Eq\  (m  =  0  and  n  =  1)  would  appear  in  which  one 
of  the  two  subscripts  is  zero,  assuming  light  profiles  as  defined  by  (7.1,2).  The 
nonlinearity  of  the  recording  process  induces  new  spatial  harmonics  such  as 
Eli  which  describe  the  modulation  of  the  coherent  grating  by  the  incoherent 
image  beam  and  hence  are  central  to  the  photorefractive  incoherent-to-coherent 
optical  conversion  process. 

The  intermodulation  decomposition  (7.3)  provides  a  natural  framework 
for  a  perturbation  series  analysis  of  the  photorefractive  incoherent-to-coherent 
optical  conversion  process  in  powers  of  the  modulation  depths  m<3  and  mg. 
a  technique  first  demonstrated  by  Kukhtanv  et  al.  as  applied  to  the  analysis 
of  single  grating  transcription  [7.18,19],  and  extended  by  Ochoa  et  al.  [7.20] 
and  by  Refregtcr  et  al.  [7.21].  In  practice,  a  typical  image  consists  of  a  mul¬ 
tiplicity  of  spatial  frequency  components,  not  just  the  single  frequency  signal 
term  postulated  in  (7.2).  Such  a  spectrally  rich  image  will  necessarily  introduce 
an  additional  summation  over  the  spectral  harmonics  in  (7.3).  The  resultant 
series,  if  limited  to  terms  of  the  form  £u,  predicts  a  linear  transcription  of  the 
image  profile  for  which  the  response  to  each  spatial  frequency  component  can 
be  evaluated  separately,  such  that  the  total  response  is  determined  by  sum¬ 
ming  up  all  harmonics.  Higher  order  terms,  such  as  £12,  describe  nonlinear 
distortion  of  the  image  spectrum  in  which  new  image  frequencies  are  generated 
that  do  not  exist  in  the  original  incoherent  image  profile.  These  higher  order 
terms  prove  to  be  extremely  tedious  to  calculate.  If  these  higher  order  terms 
contribute  significantly  to  the  conversion  process,  then  alternative  methods  of 
analysis  such  as  numerical  modeling  are  recommended.  The  analysis  presented 
in  this  chapter  concentrates  on  the  E\\  term. 

Analytical  expressions  defining  the  temporal  evolution  of  the  first  few  har¬ 
monic  terms  of  (7.3)  are  readily  derived.  As  shown  in  Appendix  7.A  and  in 
[7.22],  specific  perturbation  terms  for  the  simultaneous  erasure/ writing  mode 
(SEWM)  in  the  saturation  limit  can  be  approximated  by 

£0l  =  -$mfff£0  (7.4) 

£10  =  -^mQ^Eo  and  (7.5) 

£n  =  ^Tn^m^En  ,  (7.6) 

in  which  £0  is  the  applied  bias  field  shown  in  Fig.  7.8,  and  and  m|ff  are 
effective  modulation  depths  to  be  defined  in  the  next  section. 

If  bismuth  silicon  oxide  is  choeen  as  the  photorefractive  medium  of  in¬ 
terest,  then  the  refractive  index  modulation  4n(x)  induced  along  a  principal 
electrooptic  axis  of  the  crystal  by  the  linear  electrooptic  effect  is  proportional  to 
the  induced  space-charge  field  £(x)  [7.23].  Each  of  the  spatially  periodic  terms 
in  the  space-charge  field  (7.3)  thus  induces  a  distinct  volume  phase  grating 
through  the  linear  electrooptic  effect.  Such  a  superposition  of  phase  gratings 
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■diffracts  an  incident  collimated  readout  laser  beam  into  a  multiplicity  of  dis¬ 
crete  beams,  as  shown  in  Fig.  7.10. 

Another  fundamental  insight  into  the  PICOC  process  that  evolves  nat¬ 
urally  from  the  perturbation  series  approach  is  the  existence  of  a  one-to-one 
mapping  of  Emn.  the  spatial  frequency  components  of  the  space-charge  field, 
into  /mn.  the  diffraction  orders  and  suborders  of  the  spatially  modulated  read¬ 
out  light.  This  identification  presumes  weak  diffraction  efficiencies,  as  have  thus 
far  been  typical  of  most  PICOC  implementations.  In  the  far  field  diffraction 
limit,  shown  in  Fig.  7.10.  the  spatially  modulated  readout  light  profile  /r(x) 
decomposes  into  the  usual  set  of  discrete  diffraction  orders  associated  with  the 
coherent  grating,  labeled  by  subscript  m,  while  each  of  these  orders  further 
decomposes  into  subhannonics  associated  with  the  incoherent  image  signal.  la¬ 
beled  by  subscript  n.  This  mapping  offers  a  most  convenient  method  to  test 
the  behavior  of  distinct  spatial  frequency  components  of  the  space-charge  field 
Emn  during  the  conversion  process. 

For  typical  coherent  grating  spatial  frequencies  and  typical  crystal  thick¬ 
nesses,  the  volume  phase  gratings  in  the  photorefractive  crystal  are  recorded 
deep  within  the  Bragg  regime.  Therefore  the  optical  readout  process  can  re¬ 
spond  at  most  to  one  diffraction  order  such  as  /jo  and  its  immediate  sub¬ 
harmonics  such  as  In,  while  excluding  other  diffraction  orders  and  associated 
sidebands  such  as  Ioi  Thus  for  the  illumination  profiles  Jg(x)  and  /s(x)  of  the 
form  (7.2,3),  and  assuming  selective  diffraction  into  only  the  -(-1st  diffraction 
order  and  its  immediate  sidebands,  a  typical  form  of  the  modulated  readout 
light  profile  /r(x)  would  be 

/r(x)  =  fio  +  fu  cos  (Ksx)  -(-higher  image  harmonics  (7.7) 

Furthermore,  assuming  perfect  Bragg  alignment  for  the  I\ o  term,  and  assuming 
image  frequencies  A's  small  enough  to  avoid  Bragg  misalignment  of  the  imme¬ 
diately  adjacent  subharmonics,  the  diffracted  intensity  terms  /jo  and  /jj  are 
proportional  to  the  squares  of  the  corresponding  space-charge  field  components 
E\ o  and  E\\  for  sufficiently  low  diffraction  efficiencies.  Combining  (7.4-7)  and 
approximating  for  small  modulation  depths  m|ff  gives 


Fig.  7.10.  Far-field  diffraction  pattern  cre¬ 
ated  by  the  conversion  of  an  incoherent  grat¬ 
ing  into  ita  coherent  replica,  showing  the  map¬ 
ping  of  spatial  harmonic*  of  the  spec  e-charge 
field  into  diffraction  ordera  of  the  modulated 
readout  light  beam 
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/R(x)oc(m£f)2[l  cos  (A'sx)|  (7.8) 

in  which  higher  order  terms  have  been  neglected.  Thus  we  can  see  from  (7.8) 
that  the  incoherent  image  has  been  transcribed  onto  the  coherent  readout  beam 
with  a  reversal  of  image  contrast.  The  mapping  of  £mn  into  Imn  also  enables 
the  possibility  of  spatial  filtering  of  the  image  beam  to  change  the  negative 
image  into  a  positive  image  (by  a  schlieren  technique),  and  to  improve  the 
contrast  ratio  without  suffering  an  associated  reduction  in  image  intensity. 

The  perturbation  series  approach  that  leads  to  (7.8)  is  most  effective  when¬ 
ever  the  higher  spatial  harmonics  in  (7.7)  can  be  neglected  in  favor  of  the 
lowest  harmonics,  and  this  applies  whenever  the  modulation  depths  and 
m|ff  are  sufficiently  small  (or  the  spatial  frequencies  are  sufficiently  large). 
Such  small  modulation  depths  occur  only  for  very  restrictive  conditions  which 
seldom  match  the  experimental  conditions.  Therefore  the  scope  of  the  per¬ 
turbation  solution  is  limited,  which  reflects  as  much  a  fundamental  difficulty 
with  the  PICOC  process  as  it  does  a  difficulty  with  the  analytic  technique. 
The  PICOC  process  requires  a  nonlinear  response  to  generate  the  modulation 
of  the  coherent  grating  by  the  incoherent  image  field,  and  hence  strong  levels 
of  nonlinear  distortion  inextricably  occur  with  significant  levels  of  modulated 
light  intensity.  Restricting  the  attention  to  the  lowest  order  spatial  harmonics 
underestimates  the  nonlinearities,  but  leads  to  simple  and  powerful  analytical 
expressions  for  the  temporal  response  that  contain  considerable  physical  in¬ 
sight  into  the  PICOC  process  and  are  unobtainable  by  any  other  analytical 
technique. 

With  these  comments  as  backdrop,  we  can  now  proceed  with  a  more  de¬ 
tailed  study  of  the  recording  process  in  Sect.  7.4  and  of  the  readout  process  in 
Sect.  7.5. 

7.4  The  Recording  Process 

During  the  recording  process,  a  space-charge  electric  field  E(x)  is  formed  in  the 
photorefractive  crystal  in  response  to  the  combined  illumination  by  a  coher¬ 
ent  grating  light  beam  Ig(x)  and  an  incoherent  image  beam  Is(x).  A  physical 
model  describing  this  transcription  process  is  reviewed  in  Sect.  7.4.1.  and  par¬ 
ticular  numerical  and  analytical  solutions  are  listed  for  the  simultaneous  era¬ 
sure/writing  mode  (SEWM).  The  implications  of  these  results  on  the  nonlinear 
transfer  function  of  the  conversion  process  are  explored  in  Sect.  7.4.2.  Issues 
affecting  the  resolution  of  the  recording  process  are  discussed  in  Sect.  7.4.3. 
Finally,  the  temporal  evolution  characteristics  are  studied  in  Sect.  7.4.4. 

7.4.1  Physical  Model  and  Sample  Solutions 

The  model  of  photoconductivity  that  is  most  frequently  selected  to  describe 
holographic  grating  formation  in  photorefractive  crystals  assumes  a  single  mo¬ 
bile  charge  species  (electrons)  and  a  single  trapping  level  [7.13],  although  more 
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intricate  models  involving  hole  transport  [7  24.25]  and  multiple  trapping  „*vels 
'7  26]  have  occasionally  been  proposed  to  achieve  a  better  fit  with  particu¬ 
lar  experiments  The  simple  single  trapping  ievel/singie  mobile  charge  species 
model  which  has  been  chosen  to  describe  the  PICOC  recording  process  consists 
of  the  following  set  of  equations: 

dX~ 

~  -*•  5s/s(x.O  J](.VD  -  X*)  -  -*R,\ ’*n  (7.9) 


d n  __  <?.Vp  1  dj 
dt  dt  e  dx 


d£ 

dx 


(7.10) 

(7.11) 


J  =  enp£  +  kTn~- 
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(7.12) 
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,V£(x.f) 
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is  the  total  concentration  of  donor-like  trapping  centers, 
is  the  concentration  of  ionized  donor-like  trapping  centers 
in  quasi-equilibrium  under  dark  conditions, 
is  the  concentration  of  ionized  donor-like  trapping  centers, 
is  the  concentration  of  negatively  charged  acceptor-like 
centers  that  compensate  for  the  charge  ^  under  dark 

thermal  quasi-equilibrium  conditions  (JV“  is  a  constant  of 
the  crystal). 

s  the  concentration  of  electrons  in  the  conduction  band. 

1  he  internal  space- charge  electric  field, 

is  a  positive  number  with  magnitude  equal  to  the  electronic 

charge, 

is  the  cross  section  of  photo-ionization  for  the  coherent 
grating  beams  with  wavelength  A q  divided  by  the  photon 
energy,  hereinafter  referred  to  as  a  photo-ionization  cross 
section, 

is  the  cross  section  of  photo-ionization  for  the  incoherent 
image  beam  with  wavelength  As  divided  by  the  photon 
energy,  hereinafter  referred  to  as  a  photo-ionization  cross 
section, 

is  the  thermal  generation  rate  of  electrons  into  the 
conduction  band, 

is  the  carrier  recombination  constant, 
is  the  optical  intensity  profile  for  the  coherent  grating, 
is  the  optical  intensity  profile  for  the  incoherent  image, 
is  the  current  density  in  the  crystal, 
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M  is  a  positive  number  with  magnitude  equal  to  the  charge 

carrier  mobility, 

k  is  Boltzmann's  constant, 

T  is  the  absolute  temperature  of  the  crystal, 

s  is  the  static  dielectric  constant  of  the  crystal,  and 

£o  is  the  free  space  electric  permeability. 

Rationalized  MKS  units  are  assumed.  The  x  coordinate  is  transverse  to  the 
nominal  light  propagation  direction,  and  parallel  to  the  applied  bias  field  di¬ 
rection;  t  denotes  time.  Equation  (7.9)  is  the  rate  equation  describing  the  ex¬ 
citation  of  electrons  into  the  conduction  band  and  subsequent  recombination 
into  traps.  Equation  (7.10)  states  the  conservation  of  electric  charge.  Equation 
(7.11)  is  Maxwell's  first  equation  for  the  electric  field.  Equation  (7.12)  defines 
the  current  density  in  terms  of  drift  and  diffusion  components.  The  material 
parameters  for  bismuth  silicon  oxide  assumed  in  the  numerical  calculations  are 
listed  in  Table  7.1  taken  from  the  works  of  Tanguay  [7.27]  and  Valley  and  Klein 
[7.28].  We  wish  to  solve  the  equations  for  the  space-charge  electric  field  E{x) 
which  is  induced  by  exposure  to  the  input  optical  intensities  /q(x)  and  I%(x). 

Single  Grating  Response.  Consider  first  the  case  of  a  single  spatial  frequency 
grating  /q ( x ) ,  as  defined  by  (7.1)  of  the  previous  section,  recorded  in  the  ab¬ 
sence  of  an  incoherent  image  beam  /s(x).  Because  of  the  nonlinearity  of  the 
recording  process,  the  key  variables  of  the  photoconductivity  model  consist  of 
a  superposition  of  harmonics  of  the  incident  light  beam,  i.e.. 

+oo 

n(x.t)=  Y  nfn(t)exp(imA'Gx)  .  (7.13) 

m—- oo 

+oo 

,v+(x.t)=  Y,  ArDrn(<)exP(imAG-')  -  (7.14) 

m=— oo 

+oo 

£(x,t)=  Y  Em(t)exp(imKGx)  ,  and  (7.15) 

m= — oo 

+oo 

>(*,<)  =  Y  Jm(t)exp(imKGx)  ■  (7.16) 

ms=-oo 

These  harmonic  decompositions  can  be  substituted  into  (7.9-12),  resulting  in 
a  set  of  coupled  differential  equations  defining  the  temporal  evolution  of  each 
harmonic  component.  This  coupled  set  of  equations  can  either  be  integrated 
numerically,  as  demonstrated  by  Moharam  et  al.  [7.16]  on  a  reduced  subset  of 
the  equations,  or  else  be  solved  approximately  by  perturbation  series  methods. 

Consider  first  a  perturbation  series  expansion  in  powers  of  the  modulation 
depth  mG.  Analytical  expressions  for  the  first  order  terms  have  been  derived 
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Tabls  7.1.  Material  parameters  of  bismuth  silicon  oxide  iBuSiOjo) 


Parameter 

Symbol 

Value 

Reference 

Mobility 

M 

0  03cm:/t> 

(7  28] 

Carrier 

lifetime 

r 

5  X  10_,,s 

[7  28] 

Donor-like 
trap  density 

•VD 

1019  cm-3 

[7  28  ] 

Dark  ionized 
trap  density 

,v+ 

eq 

10“  cm-3 

[7  28] 

Recombinat  ion 
coefficient 

•'R 

2  X  10“  cm3/s 

[7  28] 

Dielectric 

constant 

€ 

56 

[7  28] 

Symbol 

488  nm 

515  nm  633  nm 

Units 

Reference 

Index  of 
refraction 

*0 

2  850 

2  615  2  530 

(7  27] 

Optical 

absorption 

a 

7  0 

2  8  0  6 

cm"1 

(7  27] 

Electrooptic 

coefficient 

^41 

4  52 

4  51  4  41 

pm/V 

(7  27] 

Photo-ionization 
cross  section 
(divided  by 
photon  energy) 

Sg 

' 

0  42 

cm3 /Joule 

[7  28] 

Note:  A  photo-ionization  cross  section  5s  at  488  nm  has  been  estimated  to  be  of  order 
lcmJ/Joule  by  Ss  =  (a%\p/ac^O )5q,  assuming  identical  quantum  efficiency  at  488  and 
515  run.  However  see  the  discussion  in  Sect.  7.4.2  and  Sprite  [7  30). 


by  Kukhtanv  et  al.  [7.18, 19],  such  that  in  the  steady  state  regime  the  first 
spatial  harmonic  component  E\  of  the  space-charge  field  is  given  by 


Ei 


1  Eg  +  \Eq(Kq) 
2m°  D(KG) 


(7.17) 


to  first  order  in  the  modulation  depth  rrtQ,  in  which  Eg  is  the  applied  bias 
electric  field  strength.  In  (7.17),  the  denominator  function  D(Kq)  is  defined 
by 


D(Kg)  =  1  -  i 


Eg  + 

Eq(KG) 


(7.18) 


291 


and  the  diffusion  field  Ed(A'g)  and  the  trap- limited  saturation  field  strength 
Eq{  A'g  )  are  defined  by 


_  kTKa 

£d=  ,  - 

(7.19) 

£  -  » 

4  s so  A'g 

(7.20) 

The  trap-limited  saturation  field  strength  Eq  defines  the  highest  electric  field 
that  can  be  generated  by  a  sinusoidal  charge  distribution  with  maximum  charge 
density  of  =  N~.  For  reduced  values  of  JV~  (and  hence  of  Afp  ),  the 

amount  of  space  charge  and  resulting  space-charge  field  strength  is  limited  as 
described  by  (7.17, 18).  Considerable  variation  has  been  reported  in  the  litera¬ 
ture  in  estimates  of  the  equilibrium  donor-like  trap  density  for  bismuth 

silicon  oxide  [7.29],  which  directly  affects  the  estimate  of  the  saturation  field 
strength  Eq,  and  hence  the  upper  limit  on  the  space-charge  field  strength. 

The  first  order  perturbation  analysis  is  accurate  only  in  the  limit  of  low 
modulation  depths  me,  whereas  many  gratings  are  written  with  the  highest 
possible  modulation  depths.  Solutions  accurate  at  higher  modulation  depths 
can  be  obtained  by  numerical  methods,  with  sample  solutions  presented  in 
Fig.  7.11  showing  the  space-charge  field  profiles  induced  by  a  single  grating  fre¬ 
quency  in  the  steady  state  regime  for  various  applied  bias  fields  and  various 
grating  frequencies.  A  modulation  depth  of  me  =  0  99  and  the  material  pa¬ 
rameters  listed  in  Table  7.1  are  assumed  for  all  curves.  Note  that  even  when 
the  writing  light  profile  is  cosinusoidal,  the  resulting  space-charge  field  pro¬ 
files  exhibit  significant  distortion  because  of  the  nonlinearity  of  the  recording 
process. 

In  practice,  only  one  of  the  multiple  spatial  harmonics  of  the  coherent 
grating  wave  vector  Kq  can  dominate  the  optical  readout  process  because  the 
grating  is  typically  recorded  very  deeply  into  the  Bragg  regime.  Figure  7.12 
therefore  shows  numerical  solutions  for  the  strength  of  just  the  first  spatial 
harmonic  component  of  the  space-charge  field  E\  as  a  function  of  the  modula¬ 
tion  depth  mQ  (labeled  in  the  figure  as  in  anticipation  of  the  two  grating 
transcription  discussion  that  follows,  but  to  be  interpreted  for  now  as  tt\q  ). 
The  family  of  curves  shown  in  Fig.  7.12  is  parametrized  by  the  ratio  Eq/Eo, 
which  scales  the  relative  magnitudes  of  the  trap-limited  saturation  field  to  the 
applied  bias  field.  Those  portions  of  the  response  curves  in  Fig.  7.12  that  can 
be  approximated  by  a  straight  line,  namely  for  low  levels  of  the  modulation 
depth  me  up  to  about  0.5  or  so,  are  the  regions  in  which  the  linear  approxima¬ 
tion  of  Kukhtarev  et  al.  [7.18,19]  most  accurately  describes  the  transcription 
process.  In  anticipation  of  the  discussion  of  nonlinear  PICOC  response  given 
in  Sect.  7.4.2,  we  find  that  high  effective  modulation  depths  (defined  by 
(7.23)  below)  for  which  the  linear  approximation  breaks  down  are  associated 
with  weak  levels  of  average  incoherent  image  intensity  I\  compared  with  the 
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FIRST  HARMONIC  COMPONENT  OF  £-3  inURwal 
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a  function  of  the  modulation 
depth  of  tho  coherent  car¬ 
rier  grating  for  varioua  level, 
of  th«  equilibrium  donor-like 
trap  denkty 
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average  coherent  grating  light  intensity  Iq.  while  low  modulation  depths  for 
which  the  linear  approximation  is  most  accurate  occur  for  strong  levels  of  av¬ 
erage  incoherent  light  illumination  I\ 

Two  Grating  Transcription.  When  an  incoherent  image  beam  is  present  simul¬ 
taneously  with  the  coherent  grating,  as  in  the  simultaneous  erasure/ writing 
mode  ( SEWM),  then  additional  terms  corresponding  to  harmonics  of  the  image 
frequencies  must  be  included  in  the  space-charge  field.  Consider  for  example  an 
image  consisting  of  just  one  sinusoidal  component,  as  given  by  (7.2).  with  a  re¬ 
sulting  space-charge  field  decomposition  of  the  form  of  (7.3).  Double  harmonic 
decompositions  analogous  to  (7.3)  can  be  assumed  for  each  of  the  variables  in 
the  photoconductive  equations  (i.e.,  the  ionized  trap  density  .V^- ,  the  electron 
density  n.  and  the  current  density  j). 

The  resulting  set  of  coupled  differential  equations  can  be  solved  either 
numerically  or  by  a  perturbation  series  analysis  with  respect  to  this  harmonic 
decomposition,  with  the  latter  being  the  approach  adopted  herein  [7.22] .  The 
perturbation  series  approach  leads  to  analytical  expressions  for  the  linear  terms 
£"l0  and  £oi  of  the  form 

£.0  =  _ImG _ - Eo  +  iED(Kp) 

10  2  GSg/0  +  Ss/i  D(Kq) 


_  1  5s/t  £o  +  iEn(Ks) 

£fll  ~  ~2msSG/o  +  Ss/i  D(KS) 


(7.22) 


as  given  in  Appendix  7.A  and  detailed  in  [7.22].  These  expressions  correspond 
very  closely  to  (7.17),  the  first  order  field  expression  derived  by  Kukhtarev  et 
al.  [7.18, 19].  The  effective  modulation  depths  and  in  (7.21,22)  which 
account  for  the  reduction  of  the  original  modulation  depths  mG  and  mg  by 
the  presence  of  both  the  incoherent  image  and  the  coherent  grating  beams  are 
defined  by 


_ 

nG  = 


5G/q 

G  5Gfo  +  S$I\ 


(7.23) 


_eff 

ms 


-  hh 

■msSG/o  +  Ss/i 


(7.24) 


The  expressions  (7.21, 22)  can  be  simplified  over  broad  operating  regions  as 
follows.  For  spatial  frequencies  less  than  of  order  200  cycles/mm  and  bias  fields 
over  2  kV /cm,  and  assuming  the  equilibrium  donor-like  trap  density  Nq  ^  given 
m  Table  7.1,  the  denominator  terms  D(Kq)  and  D(Ks)  can  be  approximated 
by  unity,  and  the  diffusion  field  can  be  neglected  in  favor  of  the  applied 
bias  field.  For  higher  spatial  frequencies,  the  denominator  factors  D(  A'G )  and 
D(K$)  and  the  diffusion  field  Ep  primarily  contribute  a  phase  shift  to  the 
coherent  grating’s  charge  distribution,  with  negligible  degradation  of  its  mag¬ 
nitude.  (This  assertion  is  justified  in  the  discussioo  on  material  limitations  in 
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Sect.  7.4.3.)  Thus  the  lowest  harmonics  of  the  space-charge  field  can  be  approx¬ 
imated  by 

£,0  =  -^rrt(fE0  (7.23 ) 

Eoi  =  _5m|^Eo  (7  26) 

By  invoking  similar  approximations,  the  magnitude  of  the  intermodula¬ 
tion  term  £n.  which  is  crucial  to  the  incoherent- to-coherent  optical  conversion 
process,  is  well  approximated  by 

En  =  .  (7.27) 

as  discussed  in  Sect.  7.4.3  and  in  [7.22],  An  identical  expression  for  £u  was 
derived  by  Marrakchx  et  al.  [7.8],  starting  from  the  constant  recombination  time 
approximation  discussed  in  Sect.  7.3.1.  This  derivation  involves  an  additional 
linearization  of  the  response  in  the  limit  of  low  modulation  depths  and 

mf. 

In  addition  to  the  magnitude  expressed  by  (7.27),  the  perturbation  series 
analysis  predicts  that  the  £n  field  is  phase  shifted  with  respect  to  the  incident 
coherent  grating.  For  steady  state  response  in  SEWM,  this  phase  shift  does  not 
significantly  impact  the  performance  of  the  conversion  process,  assuming  that 
it  remains  reasonably  constant  over  the  recording  bandwidth;  for  temporal 
response  it  can  significantly  degrade  the  usefulness  of  PICOC  for  particular 
optical  processing  architectures. 

Equations  (7.25-27)  indicate  that  the  SEWM  response  in  the  steady  state 
limit  is  predominantly  governed  by  the  reduced  modulation  depths  and 
m|ff.  The  consequent  impact  on  the  overall  readout  image  light  intensity  and 
on  the  modulation  transfer  characteristic  is  discussed  in  the  nonlinear  transfer 
response  analysis,  presented  next. 


7.4.2  Nonlinear  Transfer  Response 

The  image  transfer  for  PICOC  involves  a  performance  trade-off  between  two 
competing  mechanisms:  the  contrast  ratio  (which  involves  the  ratio  of  /u  and 
I\o )  improves  steadily  with  increasing  intensities  of  incoherent  image-bearing 
light  (in  the  absence  of  spatial  filtering),  but  at  the  same  time  the  average 
intensity  (which  involves  I\o  alone)  steadily  declines  with  increasing  incoherent 
intensity  because  tbe  uniform  background  in  the  incoherent  light  erases  the 
carrier  grating  pattern  in  the  photorefr active  crystal. 

In  many  optical  information  processing  applications,  optimization  of  the 
image  contrast  ratio  is  desirable  within  the  image  intensity  constraints  implied 
by  the  performance  trade-off  described  above.  In  other  types  of  signal  process¬ 
ing  applications  such  as  correlation  with  a  Vender  Lugt  filter,  the  dc  image 
content  contained  in  the  /jo  diffraction  component  does  not  contribute  to  the 
processing,  or  can  be  readily  modified  by  spatial  filtering,  whereas  maximizing 
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the  intensity  of  nonzero  spatial  frequency  components  such  as  7n  is  critical  to 
good  conversion  performance.  For  these  cases,  an  optimum  level  of  incoherent 
image-bearing  beam  intensity  exists  for  maximizing  the  7n  component,  beyond 
which  the  7n  component  decays  because  of  space-charge  erasure.  Therefore  the 
following  study  of  the  nonlinear  transfer  response  includes  explicit  considera¬ 
tion  of  the  7n  diffraction  order  term. 

To  obtain  a  quantitative  estimate  of  these  effects,  consider  as  a  represen¬ 
tative  example  the  simultaneous  erasure/ writing  mode  (SEWM)  in  the  steady 
state  regime,  with  a  single  spatial  frequency  coherent  grating  given  by  (7.1), 
and  a  single  spatial  frequency  incoherent  image  profile  given  by  (7.2).  One 
possible  method  of  assessing  the  image  transfer  response  is  to  determine  the 
modulation  depth  of  the  readout  image  as  a  function  of  the  input  (incoher¬ 
ent  and  coherent)  light  characteristics.  A  convenient  parameter  that  describes 
these  characteristics  is  the  product  B  R,  in  which  R  =  I\/Iq  is  the  ratio  of  the 
average  incoherent  image  light  to  the  average  coherent  grating  light  intensity 
levels,  and  B  =  Ss/Sq  is  the  ratio  of  the  photoconductive  sensitivity  of  the 
incoherent  image  beam  with  respect  to  the  coherent  grating  beam.  With  these 
definitions,  the  effective  modulation  depths  and  defined  by  (7.25,26) 
can  be  expressed  as 

mt*=T7BR  “d  (728) 


m$B  R 

TTbr 


The  diffracted  intensity  7io  is  directly  proportional  to  the  square  of  m^,  while 
the  modulation  depth  of  the  readout  image  is  similarly  proportional  to  m|ff 
The  image  transfer  response  is  plotted  in  Fig.  7.13,  which  shows  the  improve- 
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Fig.T  .13.  Normalised  diffraction  efficiency  of  the  /to  beam  (uniform  erasure),  and  the  mod¬ 
ulation  transfer  function  for  modulated  erasure,  as  a  function  of  the  intensity  ratio  R 
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ment  in  modulation  depth  of  the  readout  light  beam  with  increasing  levels 
of  incoherent  image  illumination,  but  with  concomitant  decay  of  the  average 
transferred  image  intensity  /io- 

An  unusual  but  intuitive  feature  of  the  photorefractive  incoherent-to-co- 
herent  conversion  process  (as  opposed  to  more  typical  linear  spatial  light  mod¬ 
ulation  techniques)  is  that  the  image  quality  is  primarily  determined  by  the 
ratio  of  the  incoherent  image  intensity  to  the  coherent  grating  intensity,  not 
by  the  jtim  of  these  intensities  (assuming  negligible  dark  conductivity  3  in  the 
photogeneration  rate  equation  (7.9)).  Remember,  however,  that  this  analysis 
assumes  a  recording  process  that  has  already  reached  steady  state,  and  hence 
does  not  address  the  question  of  the  time  required  to  reach  saturation,  which 
is  indeed  a  function  of  the  total  light  intensity.  Temporal  response  issues  are 
considered  in  Sect.  7.4.4  below. 

To  test  the  predictions  of  this  nonlinear  transfer  model,  the  erasure  of  the 
readout  beam’s  first  diffraction  order  I\q  in  response  to  spatially  uniform  inco¬ 
herent  illumination  has  been  measured  by  the  nondegenerate  four  wave  mixing 
configuration  discussed  in  Sect.  7.2,  with  results  as  shown  in  Fig.  7.14.  The  grat¬ 
ing  was  written  with  the  515  nm  line  of  an  argon  ion  laser,  while  the  488  nm 
line  was  used  to  simulate  the  spatially  uniform  image  beam.  For  comparison, 
predictions  of  two  different  models  of  the  conversion  process  are  included  in 
Fig.  7.14.  The  curve  labeled  “linear  approximation"  corresponds  to  the  per¬ 
turbation  analysis  term  E\q  presented  in  this  chapter,  while  the  second  curve 
labeled  “constant  recombination  time  approximation"  derives  from  (7.9)  of  the 
previously  published  model  [7.8],  which  in  turn  is  based  upon  the  saturation 
regime  model  of  Moharam  et  al.  [7.16].  The  linear  approximation  model  has 
been  scaled  in  intensity  to  match  the  experimental  points  at  R  as  0.6,  and  the 
constant  recombination  time  approximation  (Moharam’s  analytical  solution) 
has  been  scaled  to  converge  with  the  linear  approximation  for  very  large  beam 
ratios  R. 


Fig.  T. 14.  Experimental  diffraction  effi¬ 
ciency  of  the  /to  beam  (uniform  erasure) 
*a  a  function  of  the  intensity  ratio  R. 
Corresponding  theoretical  predictions  for 
the  linearised  model  and  the  constant 
recombination  time  model  are  also  shown 
for  comparison 
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Note  that  the  experimental  data  points  and  both  models  converge  reason¬ 
ably  well  for  high  levels  of  incoherent  image  illumination,  which  correspond  to 
large  intensity  ratios  R  that  by  (7.28)  imply  small  effective  modulation  depths 
rrjQ  Recall  from  Fig.  7.12  that  small  modulation  depths  correspond  to  the 
most  accurate  region  of  the  linearized  models  of  the  recording  process,  as  as¬ 
sumed  in  the  lowest  order  terms  of  the  perturbation  analysis  (as  well  as  in 
the  linearized  constant  recombination  time  approximation  discussed  following 
(7.27),  which  was  shown  to  yield  expressions  identical  to  the  perturbation  anal¬ 
ysis).  In  contrast,  for  small  intensity  ratios  R  with  concomitantly  large  effective 
modulation  depths  the  full  nonlinear  constant  recombination  time  model 
as  developed  in  [7.8]  offers  a  significantly  better  recording  approximation  than 
the  linearized  models  for  the  simultaneous  erasure/ writing  mode  (SEWM)  in 
saturation,  as  shown  clearly  in  Fig.  7.14  by  the  curve  marked  “constant  recom¬ 
bination  time  approximation".  Note  in  addition  that  the  choice  of  scaling  of  the 
constant  recombination  time  approximation  does  not  automatically  guarantee 
good  agreement  with  the  data  near  R  —  0. 

A  more  challenging  test  of  the  theory  is  to  predict  accurately  the  conversion 
response  to  a  stniuotdally  modulated  image  beam.  Such  a  test  can  be  performed 
with  the  nondegenerate  four  wave  mixing  geometry  described  previously  in 
which  a  488  am  argon  ion  laser  line  passes  through  a  Michelson  interferometer  to 
generate  a  sinusoidal  spatial  modulation,  as  shown  in  Fig.  7.15.  The  sinusoidal 
image  modulation  introduces  an  additional  diffraction  order  I\\  not  observed 
in  the  uniform  erasure  case.  The  diffracted  light  sideband  intensity  /n  has 
been  measured  as  a  function  of  the  intensity  ratio  R .  with  results  as  shown  in 
Fig.  7. 16,  and  as  compared  against  the  theoretical  prediction  of  the  perturbation 
expansion  method. 

The  general  features  of  Fig.  7.16  can  be  understood  with  reference  to 
Fig.  7.13.  For  low  levels  of  image  intensity,  corresponding  to  low  intensity  ra¬ 
tios  R.  the  coherent  grating’s  charge  pattern  is  not  significantly  erased  by  the 
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Fig.  7. IS.  Experimental  arrangement  for  aeruitmty  and  transfer  function  measurements,  as 
described  in  detail  in  the  text 
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Fig.  7.18.  Normalized  experimental 
diffraction  efficiency  of  the  /n  beam 
{strongly  modulated  erasure)  as  a  func¬ 
tion  of  the  intensity  ratio  R.  The  corre¬ 
sponding  theoretical  prediction  is  also 
shown  for  comparison 


image  beam,  as  indicated  by  the  high  level  of  I\q  in  Fig.  7.13.  In  this  regime, 
increasing  the  incoherent  image-bearing  light  intensity  increases  the  transfer 
of  image  modulation  onto  the  space-charge  grating  profile  without  destroying 
that  profile.  For  high  levels  of  incoherent  light  intensity,  corresponding  to  larger 
intensity  ratios  R,  the  transfer  of  image  modulation  onto  the  coherent  grating's 
space-charge  field  profile  is  very  high  as  shown  hy  the  readout  beam's  modula¬ 
tion  depth  curve  in  Fig.  7.13.  but  the  high  intensity  of  the  uniform  incoherent 
image  light  /i  strongly  erases  the  coherent  grating's  space-charge  profile,  as 
shown  by  the  Jio  curve  in  Fig.  7.13.  Hence  the  In  diffracted  light  component, 
which  is  derived  from  the  combination  of  the  modulation  depth  and  the  average 
grating  intensity  I\o,  exhibits  the  peaking  behavior  shown  in  Fig.  7.16. 

The  match  between  the  theoretical  curve  in  Fig.  7.16  and  the  experiment 
is  not  expected  to  be  perfect  because  the  perturbation  theory  requires  small 
modulation  depths  m<j  and  ms  for  reasonable  accuracy,  whereas  the  exper¬ 
iment  is  configured  with  the  highest  possible  modulation  depths.  To  impose 
small  modulation  depths  in  the  experiment  would  make  the  diffracted  In  light 
intensity  too  weak  to  measure  reliably,  so  a  more  exacting  test  of  the  theory 
must  await  numerical  modeling  of  high  modulation  depth  transcriptions. 

An  additional  difficulty  in  achieving  agreement  between  the  theoretical 
models  and  the  experiment  concerns  the  appropriate  value  for  the  photocon- 
ductive  sensitivity  ratio  B.  For  the  experiment  described  in  Fig.  7.14,  in  which 
the  coherent  writing  wavelength  is  515  nm,  and  the  incoherent  image  wave¬ 
length  is  488  nm,  an  estimate  of  B  —  2  can  be  derived  from  a  single  photon 
absorption  model  that  assumes  the  quantum  efficiencies  of  the  photoconductive 
processes  for  both  the  coherent  grating  beam  and  the  incoherent  image  beam 
to  be  identical  [7.8].  However,  measurements  of  the  photoconductive  quantum 
efficiency  reported  by  Sfngue  [7.30]  show  a  strong  dependence  on  the  light 
wavelength,  such  that  an  increase  in  the  sensitivity  ratio  B  by  a  factor  of  1.5 
can  reasonably  be  argued  due  to  this  dispersion  of  quantum  efficiency.  Further- 
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more,  this  factor  can  be  expected  to  vary  from  one  crystal  sample  to  another, 
depending  upon  the  detailed  growth  conditions.  On  the  other  hand,  a  reduc¬ 
tion  of  the  effective  sensitivity  ratio  B  of  as  much  as  1.7  can  also  be  argued 
for  the  crystal  thickness  used  in  this  experiment  because  of  the  dispersion  of 
the  optical  absorption,  i.e..  the  ratio  of  coherent  grating  to  incoherent  image 
beam  intensities  changes  continuously  as  both  beams  propagate  through  the 
crystal  [7.8].  Because  of  these  conflicting  arguments,  the  nominal  ratio  of  B  =  2 
in  Figs.  7.14  and  7.16  has  been  assumed  for  the  wavelengths  considered.  This 
gives  a  good  fit  for  the  uniform  erasure  /io  experimental  points,  especially  for 
the  constant  recombination  time  approximation,  but  this  ratio  causes  the  pre¬ 
dicted  modulated  erasure  In  curve  to  peak  at  a  significantly  lower  intensity 
ratio  R  than  is  indicated  by  experiment. 

In  summary,  the  broad  features  of  the  recording  nonlinearities  are  well 
understood  and  successfully  modeled  by  the  perturbation  series  approach,  e.g., 
the  decay  of  the  coherent  grating  pattern  with  increasing  amounts  of  image 
intensity,  and  the  dependence  of  the  I\\  diffracted  light  intensity  component 
on  exposure  parameters.  Detailed  agreement  will  require  both  further  analy¬ 
sis  to  model  the  nonlinearities  more  accurately,  and  better  information  about 
the  wavelength  dispersion  of  the  photoconductive  sensitivities  of  the  coherent 
grating  and  the  image-bearing  beams. 

7.4.3  Spatial  Resolution  Issues  for  the  Recording  Process 

A  number  of  distinct  factors  influence  the  ultimate  resolution  achievable  with 
the  PICOC  spatial  light  modulator.  These  factors  can  be  classified  as  geometric, 
configurational,  and  materials  related  in  nature.  The  geometric  and  materials 
related  factors  influence  the  recording  of  the  image,  and  hence  are  reviewed  in 
this  section.  The  configurational  factors  influence  the  readout  of  the  volume 
gratings,  and  as  such  are  reviewed  in  the  subsequent  section. 

Geometrical  Limitations.  Geometric  resolution  limitations  derive  principally 
from  the  incorporation  of  an  incoherent  imaging  system  in  the  four-wave  mix¬ 
ing  geometry,  and  from  the  finite  crystal  thickness  d  required  to  create  a  vol¬ 
ume  holographic  grating.  These  effects  are  illustrated  in  Figs.  7.17  and  7.18. 
Distinctly  different  resolution  performance  is  expected,  depending  upon  the 
optical  absorption  coefficients  qq  of  the  coherent  grating  light  and  a$  of  the 
incoherent  image  light. 

Figure  7.17  describes  the  case  for  low  optical  absorption  (oq d<  1  and 
agd  <  1),  such  that  the  induced  holographic  grating  has  essentially  uniform 
amplitude  throughout  the  volume  of  the  crystal.  As  can  be  expected  from 
physical  considerations,  the  optimum  focal  point  occurs  in  the  center  of  the 
crystal,  and  is  not  localized  on  the  front  surface  of  the  crystal.  The  spatial 
frequency  response  will  then  be  proportional  to  (W / 2)-1,  which  in  turn  is 
equal  to  4noF#/d  for  the  case  of  1:1  imaging,  in  which  W  is  the  diameter 
of  the  incoherent  image  beam  at  the  front  surface  of  the  crystal  (as  shown  in 
Fig.  7.17),  no  is  the  refractive  index  of  the  electrooptic  crystal,  F#  is  the  F- 
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CASE  I:  ad <  t  R - !_  ,  ±Qf£ 

(W/2)  d 


Fig.  7.17.  Geometries!  con¬ 
straints  imposed  on  PICOC  res¬ 
olution  by  the  finite  F-number 
of  the  image  input  optics  for  the 
case  ad  <  1  A  distance  of  2/  be¬ 
tween  the  imaging  lens  and  the 
photorefractive  crystal  has  been 
assumed  in  this  and  in  the  next 
figure,  corresponding  to  a  pre¬ 
sumed  1  1  lens  magnification 


EXAMPLE:  FOR  n*2.5,  F#  =  5, 


R  •  50  line  pairs/mm 


EXAMPLE:  FOR  n«2.3,  cMmm,  F#«5, 
R  •  300  line  pairs/mm 


Fig.  7.18.  Geometrical  con¬ 
straints  imposed  on  PICOC  res- 
olution  by  the  finite  F -number 
e  •  1 00 cm  0f  the  image  input  optics  for  the 

case  ad  >  1 


number  of  the  incoherent  imaging  system,  and  d  is  the  crystal  thickness.  For 
example,  for  no  —  2.5,  d  =  1mm,  and  an  F- number  of  5,  the  resolution  limit 
is  approximately  50  cycles/mm. 

In  contrast,  Fig.  7.18  describes  the  case  for  which  OQd >  1,  such  that  the 
induced  holographic  grating  has  significant  amplitude  only  within  a  thin  layer 
of  thickness  d^g  =  Oq1.  In  this  case,  the  resolution  is  given  by  4noF#OQ.  For 
n0  =  2.5,  d  =  1mm,  oq  *  100  cm-1,  and  an  F-number  of  5,  the  resolution 
limit  is  approodmateiy  500  cycles/mm,  a  factor  of  10  improvement  in  resolution. 
However,  the  diffraction  efficiency  is  reduced  by  a  factor  of  order  100  because  of 
the  reduction  in  effective  thickness  of  the  grating.  If  the  absorption  coefficient 
as  is  chosen  to  be  significantly  larger  than  oq,  then  the  resolution  will  be 
constrained  by  as  instead  of  oq. 

Material  Limitations.  An  additional  resolution  limitation  stems  from  material- 
dependent  parameter  constraints  which  influence  the  physics  of  grating  forma¬ 
tion,  in  particular  the  finite  supply  of  compensating  traps  ,  which  is  equal 
to  the  equilibrium  donor-like  trap  density  If  the  trap  density  is  limited. 
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then  the  space-charge  field  that  can  be  recorded  is  similarly  limited  because  suf¬ 
ficient  space  charge  cannot  be  generated  to  establish  any  higher  fieid  strengths. 
This  limitation  becomes  progressively  more  severe  at  higher  spatial  frequen¬ 
cies  as  expressed  by  the  D(Kq  \  factor  in  (7  17).  in  which  the  traD-hmited 
saturation  space-charge  field  Eq  is  a  function  of  the  spatial  frequency  and  the 
equilibrium  donor-like  trap  density  .V^  as  shown  in  >7  20).  The  reduction 
in  space-charge  field  with  an  increase  in  spatial  frequency  (as  exhibited  by  a 
corresponding  decrease  in  Eq)  is  shown  in  Fig.  7.12.  If  an  unlimited  supply  of 
compensating  traps  were  available  for  establishing  the  space-charge  field,  then 
the  saturation  field  Eq  would  be  infinite  and  the  factor  D(Kq)  would  conve.ge 
to  unity,  indicating  negligible  degradation  in  the  space-charge  field.  In  practice, 
the  finite  level  of  compensating  traps  leads  both  to  a  reduction  of  the  space- 
charge  field  and  to  a  phase  shift  as  consequences  of  the  finite  saturation  field 
These  two  effects  are  discussed  in  more  detail  below 

Consider  for  example  an  equilibrium  donor-like  trap  density  .Vp  of  1016 

cm-3  and  a  coherent  grating  frequency  of  300  cycles/mm.  which  implies  a  sat¬ 
uration  field  Eq  of  order  19kV/cm  and  a  diffusion  field  £d  of  order  0.5kV/cm. 
For  an  applied  bias  field  of  6kV/cm.  the  effect  of  the  factor  D(Kq  )  in  (7.17  • 
is  only  a  5%  reduction  in  the  magnitude  of  the  induced  space-charge  field  in 
the  linear  approximation.  As  these  parameters  are  typical  of  PICOC  opera¬ 
tion.  the  resultant  effect  induced  by  this  mechanism  on  the  spatial  frequency 
response  is  therefore  negligible  compared  with  alternative  response  degrada¬ 
tion  mechanisms  such  as  Bragg  detuning  on  readout,  as  discussed  in  the  next 
section. 

The  addition  of  harmonic  components  to  the  image  will  in  general  intro¬ 
duce  an  additional  amplitude  variation  in  the  space-charge  field  through  the 
denominator  factor  D( K)  in  (7.17)  (as  shown  explicitly  in  (7.A4)  of  Appendix 
7. A).  This  variation  is  negligible  compared  with  the  effect  induced  by  the  co¬ 
herent  grating  spatial  frequency  described  above. 

To  verify  the  predicted  high  bandwidth  of  the  recording  process  and  to 
eliminate  depth  of  focus  issues  as  discussed  previously,  the  Michelson  interfer¬ 
ometer  configuration  shown  in  Fig.  7.15  was  used  to  record  sinusoidal  image 
patterns  onto  a  bismuth  silicon  oxide  crystal.  The  ratio  R  of  the  image-bearing 
light  intensity  /j  to  the  coherent  grating  light  intensity  Io  was  adjusted  exper¬ 
imentally  to  maximize  the  intensity  of  the  I\  i  diffraction  order.  As  the  image 
grating  frequency  was  varied,  the  angular  alignment  of  the  readout  beam  was 
also  varied  to  maintain  optimum  Bragg  angle  alignment,  thereby  removing 
Bragg  detuning  effects  on  the  readout  resolution.  Thus  the  measured  diffracted 
intensity  /jj  corresponds  to  the  strength  of  the  space-charge  field  stored  in 
the  crystal,  with  results  as  shown  in  7  19.  A  slight  decrease  in  diffrs*- 
tion  efficiency  with  increasing  spatial  frequency  is  observed,  but  the  effective 
bandwidth  for  grating  writing  far  exceeds  the  expected  bandwidths  for  Bragg 
detuning  on  readout,  as  discussed  in  the  readout  section  below. 

A  far  more  serious  implication  of  (7.17)  for  the  space-charge  field  compo¬ 
nent  E\  is  a  shift  in  the  phase  of  the  recorded  space-charge  field  profile  with 
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Fig.  7.19.  Measured  resolution  re¬ 
sponse  of  the  photorefractive  record¬ 
ing  process,  limited  by  material  de¬ 
pendent  constraints.  These  measure¬ 
ments  were  performed  by  continu¬ 
ously  Bragg  matching  the  I\  i  order, 
thereby  removing  any  dependence 
of  bandwidth  on  Bragg  detuning,  as 
discussed  m  Sect  7  3 


respect  to  the  incident  coherent  grating  illumination  profile.  Considering  again 
an  equilibrium  donor-like  trap  density  .Vq ^  of  1018cm-3.  a  coherent  grating 
frequency  of  300  cycles/mm,  and  an  applied  bias  field  of  6kV/cm,  the  resul¬ 
tant  phase  shift  is  of  order  18°.  This  phase  shift  poses  no  problems  for  PICOC 
performance  so  long  as  it  remains  uniform  over  the  full  aperture  of  the  pnotore- 
fractive  crystal  and  the  full  bandwidth,  as  it  in  fact  varies  weakly  with  Kg  and 
mg.  On  the  other  hand,  an  image-induced  differential  phase  shift  can  prove  to 
be  problematic  for  particular  optical  processing  architectures. 

Resolution  Anisotropy.  A  final  resolution  issue  that  has  been  predicted  by  the 
perturbation  series  is  a  moderate  anisotropy  in  the  recording  of  image  structures 
parallel  as  opposed  to  perpendicular  to  the  applied  bias  field  for  the  simultane¬ 
ous  erasure/ writing  mode  (SEWM),  and  a  severe  resolution  anisotropy  for  the 
grating  inhibition  mo-ie  (GIM).  Features  of  this  analysis  are  quite  intriguing 
and  hence  are  briefly  reviewed  here. 

The  perturbation  series  analysis  is  conducive  to  a  physically  intuitive  inter¬ 
pretation  of  the  perturbation  terms  as  a  sequence  of  discrete  recording  events. 
For  the  simultaneous  erasure/writing  mode  (SEWM),  two  recording  paths  con¬ 
tribute  to  the  I\\  diffraction  order.  In  one  recording  path,  the  incoherent  image 
writes  a  space-charge  field  £oi<  independent  of  the  coherent  grating  profile. 
This  space-charge  field  Eoi  then  modulates  the  recording  of  the  coherent  grat¬ 
ing  light  beam.  This  transcription  path  is  analogous  to  the  grating  inhibition 
mode  (GIM).  In  the  second  SEWM  transcription  path,  the  coherent  grating 
profile  writes  a  space-charge  field  E\  o,  which  then  modulates  the  recording  of 
the  incoherent  image-bearing  beam.  This  second  path  is  analogous  to  the  grat¬ 
ing  erasure  mode  (GEM).  The  GEM-like  path  exhibits  nearly  perfect  isotropy 
of  response  to  an  arbitrary  image,  but  the  JIM-like  path  exhibits  a  very  strong 
anisotropy,  with  image  structures  oriented  perpendicular  to  the  applied  bias 
field  generating  much  weaker  space-charge  fields  than  structures  oriented  par¬ 
allel  to  the  bias  field. 

To  elaborate,  consider  an  image  profile  consisting  of  a  single  spatial  fre¬ 
quency,  similar  to  (7.2;,  but  oriented  such  that  the  wave  vector  Kg  is  orthogonal 
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to  the  applied  bias  field  Eq .  Thus  the  incident  image  intensity  /g(y  i  is  given 
by 

k(y)  =  h  (1  +  ms  cob  A'sy)  ,  (7.30) 

in  which  the  y  axis  is  orthogonal  to  the  applied  bias  field  Eo ■  This  image  profile, 
in  combination  with  the  coherent  grating  profile  as  given  by  (7.1).  induces  a 
space-charge  field  E(x,y)  of  the  form 

E(x,  y)  =  xE\o  cos  Kqx  +  y£oi  cos  K$y 

+  (iEn.v  +  y^liy)  cos  Kqx  cob  A'sy 

+  higher  order  terms  (7.31) 

in  which  i  and  y  are  unit  vectors  parallel  and  perpendicular  to  the  applied  bias 
field  respectively. 

Terms  such  as  £oi  and  involve  recording  a  charge  pattern  along 

a  directional  orthogonal  to  the  applied  bias  field,  and  hence  do  not  benefit 
from  the  enhancement  of  the  photoconductivity  induced  by  this  applied  field. 
In  practice,  these  terms  are  very  much  smaller  than  terms  such  as  E\q  and 
E nx  which  involve  recording  a  charge  pattern  along  a  direction  parallel  to 
the  applied  bias  field.  Analysis  using  the  perturbation  method  shows  that  the 
E\ix  term  is  reduced  by  a  factor  of  two  compared  with  the  E\\  term  given 
by  (7.5)  that  results  when  the  image  wave  vector  As  is  parallel  to  the  applied 
bias  field  Eq. 

Thus  we  find  that  the  material  limitations  on  recording  resolution  are  neg¬ 
ligible  compared  with  the  geometrical  limitations,  which  in  turn  are  of  compa¬ 
rable  magnitude  to  the  Bragg  sensitivities  on  readout  to  be  discussed  in  the 
Sect.  7.5. 

7.4.4  Temporal  Response 

A  temporal  response  analysis  is  necessary  for  the  study  of  the  grating  inhibition 
mode  (GIM)  and  the  grating  erasure  mode  (GEM)  since  these  involve  tempo¬ 
ral  sequencing,  and  the  issue  of  timing  is  crucial  to  the  optimization  of  their 
performance.  The  temporal  analysis  is  also  of  interest  for  the  simultaneous  era¬ 
sure/writing  mode  (SEWM)  because  it  clarifies  the  duration  and  nature  of  the 
transient  writing  period  before  a  stable  response  is  achieved,  and  also  because 
it  leads  to  a  very  important  reciprocity  law  between  the  incident  light  power 
and  the  response  time  of  the  converter  (assuming  that  the  dark  conductivity  of 
the  photorefractive  crystal  can  be  neglected).  A  final  and  very  important  issue 
that  arises  from  the  temporal  response  is  an  image-induced  phase  modulation 
of  the  coherent  grating’s  charge  profile  that  can  degrade  PICOC  performance 
in  some  coherent  processing  architectures. 

The  analysis  reported  here  is  restricted  to  small  modulation  regimes  for 
simplicity.  The  results  are  modified  substantially  when  operating  with  large 
modulation  depths.  In  particular,  the  transient  period  increases  significantly  in 
such  a  regime. 
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To  introduce  some  of  the  key  concepts,  consider  the  temporal  evolution 
of  the  space-charge  held  in  response  to  coherent  grating  illumination  in  the 
absence  of  an  incoherent  image.  Knkhtarev  s  study  of  this  problem  using  first 
order  perturbation  analysis  [7.19]  (in  the  absence  of  seif-diffraction  effects;  has 
shown  that  if  the  photorefractive  crystal  is  illuminated  with  an  intensity 

,  f  0  for  t <  0 

Ii-lJ)  =  w  ,,  ,  ,  _  (7.32) 

(  /o(  1  +  mG  cos  AGz)  for  t>  0 

then  the  space-charge  field  component  E\(t)  has  the  form 

£l  (t)  =  -^mG(E0+i£D(A'G)]D(/CG)-l{l-exp[-t/r(A'G)]}  (7.33) 

in  which  D( A'G),  £d(ATg),  and  £?(AG)  have  been  defined  by  (7.18-20).  The 
time  constant  T(K)  is  defined  by 


T(K,  _  T  {l-i[£o  +  i£D(A-)]/£M(-y)) 
'  {l-i[£0+.£D(A-)]/£?(A')> 


(7.34) 


in  which  To  is  the  dielectric  relaxation  time,  defined  by 


and  no  is  the  zeroth  order  estimate  of  the  electron  density  given  by 
no  =  SqIqNqt 


(7.35) 

(7.36) 


In  (7.36),  r  is  the  free  carrier  lifetime,  as  given  by 

r  =  (^vD«,rl  •  (7-37) 

A  group  of  parameters  occurs  in  (7.34)  having  the  dimensions  of  an  electric 
field.  This  field  parameter  is  assigned  the  symbol  £m(Ag).  and  is  defined  by 

£M(^G)  =  (^Gr)"1  ■  (7.38) 


The  physical  interpretation  of  this  field  parameter  can  be  best  understood  by 
considering  the  average  transport  length  of  the  mobile  charges.  When  the  drift 
contribution  to  the  current  density  in  (7.12)  dominates  over  the  diffusion  con¬ 
tribution.  the  mobile  charges  during  their  limited  lifetime  r  travel  an  average 
distance  L  =  pr£  while  under  influence  of  an  electric  field  E.  The  drift  trans¬ 
port  length  is  equal  to  the  grating  period  when  the  field  strength  £  =  2t£m- 
The  space-charge  field  in  (7.33)  exponentially  approaches  its  saturation 
limit,  generally  with  a  complex  time  constant  T(Kq)  which  denotes  oscilla¬ 
tory  aa  well  as  decaying  behavior.  At  bw  spatial  frequencies,  the  drift-induced 
transport  length  is  very  short  compared  with  the  grating  period  (equivalent  to 
having  a  field  £m  large  compared  to  the  applied  bias  field  £o).  In  this  case, 
the  decay  predicted  by  (7.33)  is  governed  essentially  by  the  dielectric  relaxation 
time  To  and  exhibits  negligible  oscillatory  behavior.  This  gives  the  fastest  possi¬ 
ble  response  time.  For  higher  spatial  frequencies  and  high  applied  bias  fields,  for 
which  the  drift-driven  charge  transport  length  greatly  exceeds  the  grating  pe- 
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riod,  the  response  time  increases  substantially  beyond  the  dielectric  relaxation 
time  and  in  addition  the  response  exhibits  oscillatory  behavior.  An  intuitive 
interpretation  of  this  phenomenon  is  that  the  finite  transport  length  blurs  the 
charge  pattern  being  transcribed,  forcing  a  longer  recording  time  to  achieve 
a  given  level  of  charge  profile  modulation.  Furthermore,  the  blur  pattern  is 
one-sided  because  the  applied  bias  field  forces  the  mobile  charges  always  in  one 
direction,  inducing  a  phase  shift  of  the  charge  pattern  being  transcribed.  In  all 
cases,  the  response  time  is  inversely  proportional  to  the  incident  light  intensity, 
so  that  doubling  the  incident  intensity  reduces  the  response  time  by  a  factor  of 
two. 

By  similar  analysis,  one  finds  that  the  erasure  of  the  resulting  space-charge 
field  by  a  spatially  uniform  incident  light  beam  also  exhibits  an  exponential 
response  with  a  time  constant  that  is  inversely  proportional  to  the  erasure 
light  intensity.  In  the  absence  of  an  applied  bias  field,  this  response  is  described 
by  a  simple  exponential  function  with  no  shift  in  the  phase  of  the  coherent 
grating’s  charge  profile.  In  the  presence  of  an  applied  bias  field,  the  uniform 
erasure  light  beam  induces  a  drift  of  the  charge  pattern  that  was  originally 
recorded  by  the  coherent  grating  b,*ams,  resulting  in  a  phase  modulation  as 
well  as  an  amplitude  modulation. 

Let  us  now  review  the  analysis  of  one  particular  version  of  the  simultaneous 
erasure /writing  mode  (SEWM)  for  the  photorefractive  incoherent  -  to-coherent 
optical  conversion  (PICOC)  process.  Consider  a  crystal  in  which  a  coherent 
grating  of  wave  vector  KG  has  been  written  and  has  reached  steady  state.  At 
time  t  an  0,  an  incoherent  image  grating  with  wave  vector  Ks  is  turned  on.  The 
intensity  incident  upon  the  crystal  is  then  described  by 


|  /0(  1  +  mG  cos  A"gx)  for  t  <  0 

1  /o(l  +  "»g  cos  KGx)  +  ij(l  +  ms  cos  K$x)  for  f>0 

(7.39) 

The  temporal  evolution  of  the  various  components  of  the  space-charge  field 
can  be  solved  by  perturbation  techniques,  valid  for  small  levels  of  the  modula¬ 
tion  depth  me,  with  explicit  expressions  for  the  lowest  order  terms  as  presented 
in  Appendix  7. A.  In  particular,  the  temporal  response  of  the  Eu  component 
has  the  general  form 


E\l(*)  *  Afo  +  Ml  exp l  -  t/Ti(Ke)]  +  Mjexp[-  t/TjfKs)] 

+  M,  exp  [  -  t/T3(KG  +  *s)J  +  M4  exp  [  -  t/T4(KG,Ks)l  , 

(7.40) 


in  which  the  M't  are  complex  coefficients  that  depend  upon  the  applied  field, 
material  parameters,  and  the  incident  light  intensities,  with  explicit  expressions 
as  given  in  Appendix  7.A  and  in  [7.22].  Time  constants  T\ ,  Tj,  and  7j  are  given 
by  (7.34)  for  spatial  frequencies  corresponding  to  Kq,  Ks,  and  (Kq  4-  Ks),  re¬ 
spectively.  The  fourth  time  constant  T\  is  given  by 

UKq,Ks)  =  [\IT(Kg)  +  \IT(KsVx  ■  (7.41) 
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The  first  term  A/t  corresponds  to  the  recording  of  the  coherent  grating  wave 
vector  A'q  in  the  absence  of  the  image  profile.  The  second  term  A/2  corre¬ 
sponds  to  the  recording  of  the  image  profile  wave  vector  A'$  in  the  absence 
of  the  coherent  grating.  The  third  term  A/3  corresponds  to  the  recording  of 
'he  intermodulation  frequency  Kq  +  A’g,  and  represents  a  resonant  response 
of  the  system  of  photorefractive  equations.  The  fourth  term  A/4  corresponds 
to  the  nonresonant  response  driven  by  the  product  of  the  coherent  grating  and 
the  incoherent  recording  processes,  which  arises  through  the  nonlinearity  of 
the  recording  process.  The  steady  state  value  of  the  £n  field  component  is 
governed  by  the  term  A/o. 

The  longest  response  time  to  reach  steady  state  is  comparable  in  value 
to  the  pure  coherent  grating  response  as  discussed  above  in  (7.34-38)  and  in 
[7.19].  This  overall  response  time  tjy,  obeys  the  following  reciprocity  law 

tiyi  =  SqIo  +  Ssh  (7A2} 

in  the  absence  of  appreciable  dark  conductivity  3  in  the  photorefractive  crystal, 
in  which  the  proportionality  constant  G  involves  only  material  parameters  and 
the  applied  bias  field  £0.  As  a  result,  the  rate  at  which  new  information  can  be 
recorded  is  determined  by  the  total  available  intensity  incident  on  the  crystal. 

Similar  temporal  response  analyses  have  been  performed  for  both  the  grat¬ 
ing  inhibition  mode  (GIM)  and  the  grating  erasure  mode  (GEM)  [7.22].  In  the 
GEM  mode,  the  response  time  constant  of  the  system  is  inversely  proportional 
to  the  incoherent  erasing  intensity  I\,  rather  than  the  sum  of  the  incoherent 
and  coherent  intensities.  Conversely,  in  the  GIM  mode,  the  response  time  con¬ 
stant  of  the  system  is  inversely  proportional  to  the  coherent  grating  intensity 

Io. 

Sample  perturbation  analysis  solutions  for  GEM,  GIM,  and  SEWM  are 
shown  in  Figs.  7.20,  7.21,  and  7.22  respectively  for  a  lmW/cm2  average  in¬ 
tensity  coherent  grating  beam  at  515  am  wavelength  with  a  grating  spatial 
frequency  of  300  cycles/mm  and  a  small  modulation  depth  me,  a  1  mW /cm2 
average  intensity  incoherent  image-bearing  beam  at  488  nm  wavelength  with 
an  image  spatial  frequency  of  10  cycles/ mm  and  also  with  a  small  modulation 
depth  mg,  an  applied  bias  field  of  6kV/cm,  and  with  the  material  parameters 
for  bismuth  silicon  oxide  as  given  in  Table  7.1. 

Figure  7.20  shows  the  GEM  response  for  three  diffracted  light  components: 
the  direct  recording  of  the  unmodulated  coherent  grating  component  I\a,  the 
direct  recording  of  the  incoherent  image-bearing  beam  /01 ,  and  the  intermodu¬ 
lation  component  In  (the  image-modulated  grating  component).  The  coherent 
grating  has  been  recorded  to  saturation  and  then  turned  off  before  the  time 
interval  shown  in  the  figure.  The  time  t  *  0  is  defined  when  the  image-bearing 
beam  is  turned  on.  Thus  the  coherent  grating  frequency  component  I\q  starts 
at  its  saturated  level  and  decays  for  times  t  >  0  because  of  erasure  by  the  image¬ 
bearing  beam.  The  direct  recording  of  the  image-bearing  beam  /oi  grows  from 
an  initial  value  of  zero  to  its  saturation  level.  Note  that  the  response  time  for 
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Fig.  7  .20.  Temporal  evolution  for  the  grating  ensure  mode  (GEM)  ae  predicted  by  the 
perturbation  senes  analysis.  Time  1  =  0  starts  after  the  coherent  grating  has  been  recorded 
and  the  incoherent  image-bearing  beam  has  just  been  turned  on  In  general  the  fjj  diffraction 
component  would  be  much  smaller  than  either  the  l\o  or  the  /oi  components  in  this  and 
in  the  next  two  figures,  corresponding  to  small  modulation  depths  and  m£*.  but  for 
simplicity  all  three  curves  are  shown  as  if  these  modulation  depths  were  unity 

Fig.  7.21.  Temporal  evolution  for  the  grating  inhibition  mode  (GIM)  as  predicted  by  the 
perturbation  series  analysis.  Time  1  =  0  starts  after  the  incoherent  image- bearing  beam  has 
been  recorded  and  the  coherent  grating  beams  have  just  been  turned  on 


the  /oi  image  component,  with  its  much  lower  spatial  frequency,  is  significantly 
faster  than  that  for  the  Jio  coherent  grating  component,  with  its  order  of  mag¬ 
nitude  higher  spatial  frequency.  The  image-modulated  grating  component  In 
exhibits  a  temporal  response  which  is  derived  from  a  combination  of  the  /jo 
and  loi  response,  eventually  evolving  into  a  slow  decay  in  time  when  the  inco¬ 
herent  image-bearing  light  beam  erases  the  coherent  grating.  The  I\  \  intensity 
is  eventually  erased  by  the  image-bearing  beam  for  very  long  recording  times, 
so  that  the  image-bearing  light  exposure  time  must  be  truncated. 

Figure  7.21  shows  a  similar  set  of  temporal  response  curves  for  the  grating 
inhibition  mode  {GIM).  The  image-bearing  light  has  been  recorded  to  satura¬ 
tion,  then  turned  off  before  the  time  interval  shown  in  this  figure.  Time  t  =  0 
is  defined  when  the  coherent  grating  light  is  turned  on.  In  this  mode,  the  pure 
coherent  grating  component  I\q  starts  with  zero  intensity  and  gradually  grows 
to  saturation,  whereas  the  directly  recorded  incoherent  image-bearing  beam 
/oi  starts  from  its  saturation  level  and  is  quite  rapidly  erased  by  the  uniform 
component  of  the  coherent  grating  light.  The  image- modulated  grating  compo¬ 
nent  /ji  shows  initially  a  very  rapid  rise,  followed  by  a  much  slower  erasure  by 
the  coherent  grating  beam,  eventually  decaying  to  zero  for  very  long  recording 
times.  In  GIM,  the  image-modulated  grating  /n  component  generally  falls  far 
short  of  its  levels  for  GEM  and  SEWM,  at  least  as  predicted  by  the  pertur- 
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Fig.  7.22.  Temporal  evolution  for  one  version  of  the  simultaneous  erasure/ writing  mode 
(SEWM)  as  predicted  by  the  perturbation  senes  analysis.  Time  1  =  0  starts  after  the  coherent 
grating  has  been  recorded  and  the  incoherent  image-beanng  beam  has  just  been  turned  on 


Fig.  7.23.  Measured  diffraction  efficiency  of  the  fu  beam  (strongly  modulated  erasure)  as 
a  function  of  tune  for  two  values  of  the  signal  intensity  for  the  simultaneous  erasure/amting 
mode,  shown  for  comparison  with  the  corresponding  theoretically  predicted  response  curves 
from  the  perturbation  senes  analysis.  In  this  figure.  Is  is  the  intensity  of  the  signal  beam(s), 
la  is  the  intensity  of  the  grating  recording  beams,  and  /r  is  the  intensity  of  the  readout 
beam 


bation  series  analysis,  because  the  direct  image  recording  is  erased  before  the 
coherent  grating  recording  has  a  chance  to  grow. 

Figure  7.22  presents  the  simultaneous  erasure/writing  mode  response,  as¬ 
suming  the  temporal  sequencing  given  by  (7.39).  The  pure  coherent  grating 
recording  I\ o  starts  with  its  initial  saturation  level  at  time  t  =  0.  and  slowly 
drops  to  a  reduced  saturation  level.  The  direct  recording  of  the  incoherent 
image-bearing  beam  /<ji  rapidly  builds  from  zero  intensity  to  saturation.  The 
image- modulated  grating  component  /ji  exhibits  strong  oscillations,  eventu¬ 
ally  settling  at  its  saturation  level,  with  a  response  time  much  longer  than  that 
of  the  foi  component. 

Measurements  of  the  temporal  response  of  the  In  diffraction  order  are 
compared  in  Fig.  7.23  with  the  temporal  response  solutions  generated  by  the 
perturbation  series  analysis  for  the  cases  of  intensity  ratios  R  =  1.5  and  R  =  5.0 
and  for  a  grating  written  in  the  nondegenerate  simultaneous  erasure/writing 
mode  (SEWM).  The  experiments  are  shown  as  solid  lines,  the  theoretical  pre¬ 
dictions  as  dashed  lines.  In  this  figure,  the  coherent  grating  is  established  in 
the  saturation  regime  at  time  t  =  0,  at  which  point  the  incoherent  erasure 
beam  is  allowed  to  expose  the  crystal,  as  given  by  (7.39).  For  a  small  R  ratio 
(/i  a  /j  a  0.6mW/em2  in  Fig.  7.23),  the  experimental  diffraction  efficiency 
increases  monotonically,  at  least  within  the  time  interval  of  this  figure.  For 
a  strong  incoherent  beam  (/j  =  /g  =  2  mW/cm3  in  Fig.  7.23),  a  transient 


effect  appears  in  the  experimental  response  within  this  same  time  interval.  Ini¬ 
tially,  the  incident  beam  diffracts  from  the  composite  grating  at  wave  vector 
( A'g  +  K$)  to  generate  a  rapid  nse  in  the  amplitude  of  I but  the  strong 
incoherent  illumination  eventually  erases  the  coherent  grating  and  hence  the 
diffraction  efficiency  decreases  to  a  small  steady-state  value. 

The  theoretical  curves  are  scaled  in  peak  intensity  and  in  dielectric  relax¬ 
ation  time  to  achieve  a  reasonable  match  with  the  experiments,  but  once  the 
scale  is  defined  for  one  curve,  it  fixes  the  scale  for  both  theoretical  curves.  The 
dielectric  relaxation  time  needed  to  achieve  agreement  between  the  theoretical 
curves  and  the  experimental  data  is  a  factor  3  slower  than  that  derived  from 
the  bismuth  silicon  oxide  parameters  given  in  Table  7.1  per  (7.35-37).  When 
comparing  the  theoretical  with  the  experimental  curves  in  Fig.  7.23,  it  should 
be  noted  that  the  theory  is  most  accurate  for  low  modulation  depths,  whereas 
the  experiment  is  performed  with  the  highest  possible  modulation  depths  for 
both  the  coherent  grating  and  the  incoherent  image  beams.  Even  so.  the  match 
between  the  theoretical  response  and  the  measured  response  is  quite  striking. 

The  time  constant  for  this  particular  set  of  experimental  parameters  is 
in  the  range  0.5  to  1.5  s.  To  achieve  conversion  rates  of  30  frames  per  second 
in  bismuth  silicon  oxide,  a  total  light  intensity  of  order  35  to  45mW/cm2  is 
extrapolated,  based  upon  the  reciprocity  law  given  in  (7.42),  and  assuming 
that  the  ratio  of  incoherent  image-bearing  light  to  coherent  grating  light  is 
kept  constant. 


7.5  The  Readout  Process 

The  readout  process  consists  of  the  optical  modulation  of  the  coherent  readout 
beam  by  the  space-charge  field.  This  modulation  occurs  in  PICOC  through  the 
linear  electrooptic  effect  which  modifies  the  refractive  index  within  the  photore- 
fractive  crystal,  thus  establishing  a  volume  phase  grating.  The  phase  hologram 
is  then  read  out  by  a  coherent  auxiliary  beam  to  achieve  the  conversion. 

The  diffraction  characteristics  of  such  volume  phase  gratings  have  been 
studied  using  many  different  analytical  techniques,  including  coupled  wave 
analysis  by  Kogelnik  [7.31]  with  extensive  numerical  studies  by  Klein  and  Cook 
[7.32],  a  Bom  approximation  to  a  scattering  integral  by  Gordon  [7.33],  and 
the  optical  beam  propagation  method  by  Yemck  and  Thylen  [7.34]  and  John¬ 
son  and  Tangnoy  [7.35].  Methods  for  studying  the  polarization  properties  of 
light  diffraction  in  electrooptic  crystals  such  as  bismuth  silicon  oxide  include 
anisotropic  versions  of  the  coupled  wave  formalism  [7.23,36]  and  the  optical 
beam  propagation  method  [7.35]. 

This  section  examines  the  readout  of  the  phase  gratings  and  its  conse¬ 
quences  for  the  performance  of  PICOC  as  a  spatial  light  modulator.  Because  of 
the  high  spatial  frequencies  typically  used  in  PICOC,  the  grating  exhibits  pro¬ 
nounced  Bragg  diffraction  characteristics  with  rapid  degradation  of  the  read¬ 
out  quality  whenever  the  Bragg  condition  is  detuned,  whether  by  increasing 


the  spatial  frequency  of  the  incoherent  image,  by  slight  angular  misalignment, 
or  by  sub-optimum  alignment  of  the  incoherent  image  beam.  The  Bragg  sensi¬ 
tivity  is  discussed  in  the  section  on  the  isotropic  phase  grating  properties.  In 
addition,  because  of  the  optical  activity  exhibited  by  bismuth  silicon  oxide,  the 
polarization  states  of  both  the  transmitted  probe  light  and  the  diffracted  signal 
light  will  in  general  be  elliptical,  with  implications  for  the  optimum  readout 
conditions,  as  discussed  in  the  section  on  polarization  properties. 


7.5.1  Isotropic  Phase  Grating  Model 

The  space-charge  field  induces  a  small  perturbation  in  the  index  of  refraction 
through  the  linear  electrooptic  effect,  such  that 

_ln(z)  =  ^ngr4i£i  cos  {Kqx)  (. 7.43 ) 

in  which  no  is  the  nominal  index  of  refraction  and  is  the  electrooptic  co¬ 
efficient  appropriate  for  bismuth  silicon  oxide.  The  index  perturbation  in  turn 
modulates  the  optical  phase  fronts  of  an  incident  light  beam.  Thus  the  sinu¬ 
soidal  space-charge  field  induces  a  volume  phase  grating  in  the  crystal.  To  gain 
some  feeling  for  the  readout  performance  issues  involved  in  PICOC,  consider  a 
2  mm  thick  piece  of  bismuth  silicon  oxide  with  a  simple  unmodulated  sinusoidal 
grating  with  spatial  frequency  of  300  cycles/ mm,  space-charge  field  of  5kV/cm, 
and  probed  by  a  633  nm  laser  beam.  A  space-charge  field  E\  of  order  5kV/cm 
induces  an  index  perturbation  An  of  order  2  X  10~5,  asstiming  the  material 
parameters  for  bismuth  silicon  oxide  listed  in  Table  7.1. 

Dimensionless  parameters  that  characterize  the  diffraction  characteristics 
of  such  a  phase  grating  include  the  grating  thickness  parameter  Q  and  the 
grating  strength  v,  defined  by 

Q  =  and  (7.44) 


_  2  xAnd 

^R 


(7.45) 


in  which  Ar  is  the  wavelength  of  the  readout  light  in  vacuo  and  d  is  the  thick¬ 
ness  of  the  grating  [7.32).  For  a  300  cyclea/mm  grating  in  a  2  mm  thick  crystal, 
read  out  by  a  633  nm  laser  beam,  the  associated  thickness  parameter  Q  is  al¬ 
most  300,  which  is  considered  to  be  a  very  thick  grating  [7.30].  A  space-charge 
field  of  the  order  of  5kV/cm  induces  a  grating  strength  of  the  order  of  0.35 
radians,  which  gives  fairly  weak  diffraction  efficiency  (on  the  order  of  a  few 
per  cent).  The  combination  of  large  thickness  parameter  Q  and  weak  grating 
strength  v  places  the  grating  deep  in  the  Bragg  regime  with  an  optical  diffrac¬ 
tion  efficiency  q  given  approximately  by 


[(v/2tf)  sin  e]a  ,  with 

(7.46) 

(f,+ xf  “d 

(7.47) 
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£  =  ^KGd  sin  (0in  -  0B)  (7.48) 

(according  to  [Ref.  7.31,  Eqs.  17,  42,  and  43]  or  [Ref.  7.32,  Eqs.  6-8,  35,  and 
36]).  in  which  0jn  is  the  entrance  angle  of  the  incident  probe  light  measured 
with  respect  to  the  constant  phase  lines  of  the  coherent  grating,  and  #r(  Kq  ) 
is  the  Bragg  angle  associated  with  wave  vector  Kq,  defined  by 

si” 

The  pa.ameter  £  in  (7.47,48)  is  a  measure  of  the  Bragg  misalignment;  it  is 
equal  to  zero  when  the  readout  beam  is  perfectly  Bragg  aligned  with  respect 
to  the  volume  hologram. 

For  perfect  Bragg  alignment  in  which  0jn  =  and  assuming  a  grating 
strength  v  =  0.35  radians,  the  peak  diffraction  efficiency  is  estimated  to  be  of 
order  3  %.  Doubling  the  thickness  of  the  grating  would  increase  the  diffraction 
efficiency  to  12%  (ignoring  polarization  issues),  but  also  increases  the  Bragg 
detuning  sensitivity,  as  described  in  the  following  paragraphs. 

Bragg  detuning  impacts  PICOC  performance  in  two  ways:  the  angular 
alignment  sensitivity  of  the  hologram  to  the  coherent  readout  beam,  and  the 
spatial  frequency  response  of  the  hologram  readout  with  its  concomitant  angu¬ 
lar  alignment  dependence  on  the  incoherent  image-bearing  beam.  Consider  first 
the  alignment  sensitivity  to  the  coherent  readout  beam.  An  angular  misalign¬ 
ment  Ad  from  the  optimum  Bragg  alignment  introduces  a  Bragg  mismatch  £ 
of 

£  *  ^KadAB  ,  (7.50) 

as  seen  from  (7.48).  Assuming  a  grating  strength  v  =  0.35  radians,  one  finds 
from  (7.46,47)  that  the  angular  alignment  sensitivity  of  the  diffraction  effi¬ 
ciency  has  essentially  a  sine2  profile.  The  angular  misalignment  AB  needed  to 
reach  the  first  null  of  this  profile  occurs  when  £  =  x,  i.e.,  for 

<7-"> 

Hence  the  angular  alignment  needed  to  achieve  optimum  diffraction  efficiency 
is  extremely  sensitive,  on  the  order  of  0.1°  for  a  2  mm  thick  photorefractive 
crystal  with  a  300  cycles/mm  grating  frequency. 

One  possible  alignment  of  the  PICOC  system  is  to  orient  the  readout  beam 
to  be  Bragg  matched  precisely  to  the  coherent  grating,  thereby  maximizing  the 
intensity  of  the  I\q  diffracted  order.  However,  when  an  incoherent  giating  is  also 
incident  upon  the  crystal,  the  nonlinear  recording  process  creates  a  new  grating 
with  wave  vector  (Kq  +  ft’s)  that  in  general  does  not  satisfy  the  same  Bragg 
condition  as  the  coherent  grating  wave  vector  Kq.  The  resulting  In  intensity 
component  is  then  attenuated  by  an  amount  dependent  upon  the  image  wave 
vector  ft’s  ~  2x/s,  in  which  /s  is  the  spatial  frequency  of  the  image  profile. 
The  dependence  of  the  attenuation  on  the  image  frequency  f$  is  a  function  of 
the  orientation  of  the  image- bearing  beam  with  respect  to  the  coherent  grating. 
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The  impor'anre  ■'{  ’he  image-bearing  beam  orientation  on  the  spatial  fre- 
pier.ev  response  of  ’he  converter  :s  strikingly  illustrated  bv  F:g  7  24.  m  which 
if”  -r.'wn  cnnver'e.i  images  ot  two  orthogonal  orientations  ot  a  o  line  pair  mm 
R  t.  t.i  riling  and  "he  associated  coherent  Fourier  transforms  As  can  be  seen 
from  the  figure,  a  significant  difference  in  resolution  exists  between  cases  m 
which  ’he  wave  vector  of  the  miing  incoherent  grating  is  parade!  or  perpen¬ 
dicular  to  fhe  coherently  written  •  hoiograpiuc  <  grating  This  difference  derives 
principally  from  the  fact  that  a  different  wave  vector  matching  condition  exists 
for  'hese  two  cases. 

Consider  the  alternative  wave  vector  matching  conditions  shown  in  Figs. 
7  25  and  7  26.  In  Fig  7  25.  the  incoherent  grating  wave  vector  A’s  is  parallel  to 
•he  coherent  grating  wave  vector  Kq.  a  condition  achieved  by  symmetrically 
disposing  the  incident  coherent  beams  about  the  normal  to  the  crystal  while 
simultaneously  arranging  the  incoherent  imaging  system  such  that  its  optical 
axis  is  parallel  to  the  crystal  normal.  In  this  case,  significant  Bragg  detuning 
occurs  for  even  small  incoherent  grating  wave  vectors. 


Fig.  7.24.  Photograph  of  tho  Founer  transform  of  *  Ronchi  ruling  recorded  in  the  PICOC 
configuration,  showing  a  strong  anisotropy  in  tho  resolution  performance  when  the  image- 
bean  ng  light  beam  btsects  the  two  coherent  grating  wnting  beams 
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EQUIPHASE 


Fig.  7.25.  W*ve  vector  rm»- 
m»tch  diagram  for  the  caae 
in  which  the  image-beanng 
beam  btaecta  the  two  coher¬ 
ent  grating  writing  beams 


Let  the  spatial  bandwidth  of  the  readout  process  be  defined  as  that  spatial 
frequency  /s  for  which  the  magnitude  of  the  /n  diffracted  intensity  component 
degrades  to  25%  of  its  peak  value.  This  occurs  when  (  =  1.9,  as  determined 
from  (7.46,47),  assuming  a  small  grating  strength  t>  on  the  order  of  0.35  ra¬ 
dians.  The  Bragg  mismatch  parameter  (  is  a  function  of  the  image  spatial 
frequency  /$,  and  can  be  approximated  by  the  first  term  or  two  in  a  Taylor 
series  expansion  of  the  form 

iifs)  =  £o  +  /s^r  +  \fiy$ 

+  higher  order  terms  (7.52) 

in  which  (o  represents  the  Bragg  mismatch  associated  with  the  coherent  grat¬ 
ing.  Standard  alignment  procedure  is  to  set  this  zeroth  order  mismatch  term  (o 
to  zero,  i.e.,  to  Bragg  match  the  incident  readout  beam  to  the  coherent  grating. 

For  the  caae  of  the  image-bearing  beam  bisecting  the  two  coherent  grating 
beams,  as  shown  in  Fig.  7.25,  the  first  order  term  in  (7.52)  is  a  sufficiently 
accurate  estimate  of  the  Bragg  mismatch  parameter  and  this  term  can  be 
derived  from  (7.48)  to  give 

3J  =  -\Kailu  ■  (7M) 


By  (7.49),  the  term  d0s/4fe  is  equivalent  to 

dfs  2  no  cos  $b 


Hence  the  spatial  bandwidth  /s  is 
7  6  no  coo 
XHKad  ’ 

which  can  be  expressed  in  terms  of  the  fringe  spacing  /Iq  of  the  coherent  grating 
wavelength  as 

1.2np  Aq  cos  8b 
ARd 


/s 


fs  = 


(7.55) 


(7.56) 
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Fig.  7.26.  Optimum  w&vt 
vector  alignment  diagram 
in  which  the  image-bear¬ 
ing  light  is  tilted  with  re¬ 
spect  to  the  bisector  of  the 
two  coherent  grating  writ¬ 
ing  beam a  by  the  Bragg 
angle  associated  with  the 
probe  beam 


For  the  parameters  used  in  our  experiments,  this  bandwidth  is  estimated  to 
be  on  the  order  of  8  cycles/mm.  Note  that  doubling  the  crystal  thickness  d 
to  increase  the  diffraction  efficiency  by  almost  a  factor  of  four  has  the  adverse 
effect  of  reducing  the  spatial  bandwidth  ft,  by  a  factor  of  two  for  this  particular 
alignment  configuration. 

Compare  now  the  resolution  performance  associated  with  the  alignment  of 
Fig.  7.25  with  that  for  Fig.  7.26.  In  Fig.  7.26.  the  incoherent  grating  wave  vector 
is  arranged  to  lie  tangentially  to  the  circle  defined  by  the  readout  beam  wave 
vector,  such  that  a  significantly  increased  angular  deviation  of  the  diffracted 
beam  is  allowed  before  serious  Bragg  detuning  effects  occur.  Such  a  wave  vec¬ 
tor  tangency  condition  is  automatically  satisfied  when  the  incoherent  image 
wave  vector  is  normal  to  the  coherent  grating  wave  vector  (as  it  is  in  the  y 
orientation  normal  to  the  plane  of  incidence).  Alternatively,  the  wave  vector 
tangency  condition  is  satisfied  when  the  central  ray  of  the  incoherent  image 
beam  is  parallel  to  the  diffracted  probe  light,  for  which  the  horizontal  and 
vertical  resolutions  become  degenerate.  This  is  not  the  case  for  the  situation 
depicted  in  Fig.  7.25,  which  explains  the  observations  apparent  in  Fig.  7.24.  An 
equivalent  condition  has  been  described  by  JCarruhtJm  and  Petrov  [7.6]  for  the 
nondegenerate  four-wave  mixing  optical  architecture. 

In  the  geometry  of  Fig.  7.26,  the  addition  of  an  image  frequency  f$  shifts 
the  pointing  direction  of  the  combined  wave  vector  ( Kq  -l-  K$)  such  that  the 
incident  readout  light  remains  almost  perfectly  Bragg  matched  over  a  much 
broader  range  of  image  frequencies.  Mathematically,  this  condition  is  equivalent 
to  setting  the  first  order  term  d£/df 3  of  the  Taylor  series  expansion  in  (7.52)  for 
the  Bragg  mismatch  parameter  (  equal  to  zero.  To  demonstrate  this,  consider 
the  more  general  case  in  which  the  image  profile  wave  vector  Afs  is  oriented 
at  a  small  angle  0$  with  respect  to  the  coherent  grating  wave  vector  Kq  .  The 
combination  ( Kq  +  K%)  rotates  through  a  small  angle  AO  compared  with  wave 
vector  Kq  alone,  with  AO  given  approximately  by 


The  Bragg  mismatch  parameter  {  induced  by  the  image  wave  vector  K§  is 
to  first  order  given  by 


« =  ih* + 


(7.58) 


and  these  two  terms  cancel  when  the  skew  angle  =  ®B*  the  Bragg  angle  for 
the  readout  light  beam  associated  with  the  coherent  grating  frequency.  For  a 
633  nm  readout  beam  wavelength  and  a  300  cycles/mm  grating  frequency,  the 
Bragg  angle  is  of  order  5°,  implying  that  a  mere  5°  separates  the  optimum 
resolution  alignment  of  Fig.  7.26  from  the  alignment  of  Fig.  7.25.  In  addition, 
the  angular  alignment  tolerance  on  the  incoherent  image-bearing  beam  is  much 
tighter  than  the  nominal  alignment  angle  of  5°,  typically  of  order  0.7°. 

With  the  alignment  of  Fig.  7.26,  the  degradation  in  diffraction  efficiency 
then  becomes  a  second  order  function  of  the  image  spatial  frequency  /$.  The 
second  derivative  term  d^/d/J  in  (7.52)  is 


tfii  _  irdXfj 
dfl  no 

and  the  resulting  spatial  bandwidth  of  the  tangential  configuration  is 


(7.59) 


This  expression  was  first  published  by  Kamshilin  and  Petrov  [7.6].  For  our  ex¬ 
perimental  parameters,  a  tangential  geometry  increases  the  converter’s  band¬ 
width  from  8  to  48  cycles/mm.  It  is  interesting  to  note  that,  in  addition  to 
the  increased  bandwidth  of  the  tangential  geometry,  the  spatial  resolution  is 
independent  of  the  coherent  grating  frequency,  and  that  doubling  the  thickness 
of  the  crystal  d  does  not  halve  the  converter’s  bandwidth,  as  it  would  for  the 
alignment  configuration  shown  in  Fig.  7.25,  but  only  reduces  it  by  a  factor  of 


v2- 


Further  experimental  tests  of  the  Bragg  detuning  hypotheses  are  presented 
in  Figs.  7.27-29,  in  which  the  image  source  was  a  Michelson  interferometer 
(shown  in  Fig.  7.15)  to  alleviate  the  depth  of  focus  issues  discussed  previously. 
In  these  experiments,  the  intensity  of  the  diffracted  component  I\  i  was  mea¬ 
sured  as  a  function  of  the  spatial  frequency  of  the  image  source  grating.  Figure 
7.27  shows  the  response  for  the  geometry  in  which  the  incoherent  image  beam 
bisects  the  coherent  grating  writing  beams  to  achieve  the  wave  vector  mis¬ 
match  condition  diagrammed  in  Fig.  7.25.  The  predicted  frequency  response 
rolloff  with  bandwidth  of  8  cycles/mm,  shown  by  the  solid  line,  compares  rea¬ 
sonably  well  with  the  experimental  data  points  which  indicate  a  bandwidth  of 
6  cycles/mm  Figure  7.28  shows  the  rolloff  when  the  signal  beam  wave  vector 
is  tangent  to  the  equiphase  circle  as  shown  in  Fig.  7.26,  giving  much  improved 
frequency  response  of  45  cycles/mm  which  is  in  excellent  agreement  with  the 
theoretically  predicted  48  cycles/mm.  Figure  7.29  shows  the  rolloff  when  the 
signal  light  wawe  vector  is  nominally  aligned  to  be  perpendicular  to  the  applied 
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Fig.  7.37.  Experiment*]  mea¬ 
surement  of  the  diffraction  ef¬ 
ficiency  aa  a  function  of  im¬ 
age  spatial  frequency  associated 
with  the  alignment  shown  in 
Fig. 7  25 


Fig.  7. 3t.  Experimental  mea¬ 
surement  of  the  diffraction  ef¬ 
ficiency  as  a  function  of  im¬ 
age  spatial  frequency  associated 
with  the  optimum  alignment 
shown  in  Fig.  7.26 


Fig.  7.39.  Experimental  mea¬ 
surement  of  the  diffraction  ef¬ 
ficiency  aa  a  function  of  im¬ 
age  spatial  frequency  for  the 
alignment  in  which  the  image¬ 
bearing  beam's  wave  vector  is 
normal  to  the  applied  bias  elec¬ 
tric  field 


bias  field  sad  to  the  coherent  grating  wave  vector.  The  theoretical  bandwidth 
for  this  configuration  should  be  identical  to  that  shown  in  Fig.  7.28,  but  the 
measurements  indicate  a  bandwidth  of  only  25  cycles/mm.  This  discrepancy 
may  be  attributable  to  the  very  high  alignment  sensitivity  of  the  incoherent  im¬ 
age  beam  relative  to  the  plane  defined  by  the  two  coherent  writing  beams,  e.g., 
a  deviation  of  approximately  0.7°  would  cause  a  deterioration  of  the  bandwidth 
from  48  cycles/ mm  to  25  cycles/mm. 
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To  summarize,  the  presence  of  the  coherent  grating  in  PICOC  defines  a 
volume  hologram  that  is  typically  operated  quite  deep  into  the  Bragg  regime, 
with  very  strong  misalignment  sensitivities.  As  a  consequence,  the  readout 
beam  must  be  aligned  typically  within  0.1°  of  optimum,  the  image-bearing 
beam  must  be  aligned  typically  within  0.7°  of  its  optimum,  and  the  optimum 
alignment  for  the  image-bearing  beam  is  not  to  bisect  the  two  coherent  grating 
beams  (Fig.  7.25),  but  rather  offset  from  this  by  a  small  angle  typically  of  order 
5°  (as  shown  in  Fig.  7.26). 

7.5.2  Polarization  Issues 

Bismuth  silicon  oxide  has  quite  remarkable  optical  polarization  properties  that 
strongly  influence  the  optimization  of  the  readout  process  [7.23).  These  proper¬ 
ties  include  significant  levels  of  natural  optical  activity  (as  high  as  46° /mm  for 
the  488  nm  argon  ion  laser  wavelength  -  see  Fig.  7.30),  a  uniform  linear  bire¬ 
fringence  induced  by  the  applied  bias  field  through  the  electrooptic  effect,  and  a 
spatially  varying  linear  birefringence  induced  by  the  image-defined  space-charge 
field.  In  almost  all  readout  configurations,  these  properties  will  cause  both  the 
readout  and  the  diffracted  signal  beams  to  exhibit  elliptical  polarizations. 

Consider  Fig.  7.31,  in  which  is  shown  the  evolution  of  the  polarization 
states  for  both  the  readout  and  the  diffracted  signal  beams  as  a  function  of 
depth  into  the  bismuth  silicon  oxide  crystal  for  a  633  nm  wavelength  readout 
beam  and  an  applied  bias  field  of  6kV/cm  in  the  absence  of  self-diffraction 
effects.  A  hologram  induced  by  a  single  coherent  grating  unmodified  by  any 
image  profile  is  assumed.  In  this  figure,  we  have  chosen  to  plot  the  polarization 
states  that  result  from  an  input  polarization  set  at  45°  with  respect  to  the 
grating  wave  vector,  which  is  therefore  along  one  of  the  two  electrooptically  in- 


Fig.  7.30.  Optical  rotatory  power  of  bismuth 
silicon  oxide  as  a  function  of  the  wavelength 
of  the  readout  light  (after  [7.27]) 
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duced  principal  axes  of  the  crystal's  index  ellipsoid.  As  such,  the  polarizations 
of  the  readout  and  signal  beams  are  nearly  parallel  for  very  shallow  depths 
but  quickly  evolve  toward  a  90°  major  axis  separation  with  increasing  depth 
because  of  the  influence  of  the  optical  activity.  This  separation  of  polarization 
states  enables  the  use  of  polarization  analyzer  techniques  to  suppress  the  scat¬ 
tered  readout  beam  light  in  favor  of  the  diffracted  signal  light.  The  intricacy 
of  the  polarization  state  evolution  can  be  appreciated  from  Fig.  7.31. 

One  potential  application  of  the  polarization  properties  in  bismuth  silicon 
oxide  has  been  demonstrated  by  Hemau  et  al.  [7.15]  for  obtaining  optimum 
holographic  readout.  They  attained  excellent  suppression  of  scattered  readout 
light  noise  for  a  nearly  on-axis  recording  configuration  by  placing  a  polarization 
analyzer  into  the  diffracted  signal  beam  path.  The  analyzer  is  then  adjusted 
to  eliminate  the  scattered  readout  beam,  and  since  the  diffracted  signal  beam 
generally  has  a  different  polarization  state  from  that  of  the  transmitted  readout 
beam,  a  significant  fraction  of  the  signal  beam  will  pass  through  the  analyzer. 
This  technique  greatly  improves  the  signal- to- noise  ratio  of  the  holographic 
reconstruction  process. 

The  presence  of  spatial  modulation  further  complicates  the  polarization 
properties  of  the  diffracted  light,  especially  when  the  incoherent  image  beam 
is  misaligned  from  the  optimum  wave  vector  matching  configuration  shown  in 
Fig.  7.26,  which  introduces  a  strong  polarization  state  dispersion.  That  is,  the 


Pig.  T  Jl.  Sample  evolution  of  the  polarisation  stataa  of  the  uradent  readout  beam  and  the 
diffracted  ugnal  beam  for  a  simple  sinusoidal  grating 
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polarization  states  of  the  modulated  readout  beam's  diffraction  orders  (/mn 
in  Fig.  7.10)  are  strongly  dependent  on  the  image  spatial  frequency  when  the 
optimum  alignment  of  Fig.  7.26  is  not  achieved.  Such  dispersion  further  de¬ 
grades  the  resolution  if  a  polarization  analyzer  is  used  to  separate  the  scattered 
readout  light  from  the  diffracted  signal  light.  In  contrast,  when  the  optimum 
alignment  of  Fig.  7.26  is  met,  the  polarization  dispersion  is  negligible.  This  is 
one  more  reason  why  the  alignment  of  Fig.  7.26  is  crucial  to  achieving  the  best 
performance  from  the  PICOC  device. 

The  most  serious  issue  concerning  the  polarization  properties  of  volume 
holograms  in  bismuth  silicon  oxide  is  the  degradation  of  the  light  diffraction 
efficiency  that  is  imposed  by  the  optical  rotatory  power  [7.23].  However,  one 
readout  configuration  has  been  identified  in  extensive  polarization  analyses 
that  does  not  degrade  the  diffraction  efficiency,  namely  the  crystal  geometry 
of  Fig.  7.8  with  no  applied  bias  field  and  with  a  circularly  polarized  readout 
light  beam.  The  diffraction  efficiency  for  a  linearly  polarized  readout  berm  is 
compared  with  that  of  a  circularly  polarized  readout  beam  in  Fig.  7.32,  showing 
the  marked  improvement  in  diffraction  efficiency  that  can  be  achieved  by  using 
circularly  polarized  light  at  the  correct  alignment. 

In  conclusion,  the  polarization  properties  of  light  diffraction  in  bismuth 
silicon  oxide  significantly  affect  the  performance  of  the  PICOC  modulator,  and 
can  be  exploited  to  improve  this  performance  in  terms  of  signal-to-noise  ratio 
and  to  attain  the  highest  possible  diffraction  efficiency. 


) 


i 


Fig.  7.33.  Comparison  of  the 
diffraction  efficiency  achieved  by 
linearly  polarised  and  circularly 
polarised  probe  light  when  the 
applied  bias  field  is  set  to  xero 
during  the  readout  process 


7.6  Conclusion 

Through  the  use  of  the  photorefractive  incoherent -to-coherent  optical  conver¬ 
sion  (PICOC)  process,  we  have  successfully  converted  incoherent  images  into 
their  negative  coherent  replicas.  The  PICOC  system  is  inexpensive,  easily  im- 
plementable,  and  compares  favorably  in  its  performance  with  other  photorefrac- 
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tive  spatial  light  modulators.  In  addition,  we  have  developed  a  mathematical 
framework  within  which  to  analyze  the  performance  of  the  converter  Using  this 
formalism,  we  have  derived  the  conditions  necessary  to  achieve  high  linearity, 
good  contrast  ratio,  and  fast  temporal  response.  Experiments  have  been  con¬ 
ducted  which  verified  the  essential  features  of  these  predictions,  and  which  have 
underscored  important  materials  issues.  It  is  of  considerable  importance  to  ex- 
^  tend  the  model  to  account  more  accurately  for  these  materials  issues,  to  predict 

the  response  of  the  converter  at  higher  modulation  depths,  and  to  explore  the 
optical  phase  modulation  induced  in  the  output  image  and  its  impact  on  vari¬ 
ous  image  processing  architectures.  In  addition,  testing  of  the  photorefractive 
incoherent-to-coherent  optical  converter  in  a  representative  optical  image  pro¬ 
cessing  system  will  undoubtedly  highlight  significant  features  worthy  of  further 
&  study  and  advanced  development. 

7. A  Appendix.  Steady  State  and  Temporal  Behavior 
of  the  Space-Charge  Field  Components  in  PICOC 
^  (Simultaneous  Erasure  /Writing  Mode) 

In  this  Appendix,  the  expressions  for  the  lowest  order  components  of  the  space- 
charge  field  Emn  are  presented,  both  in  the  steady  state  and  with  full  temporal 
evolution.  Details  of  the  derivation  of  these  expressions  can  be  found  in  [7.22]. 

I  7.A.1  Steady  State  Behavior 

If  the  light  incident  on  the  photorefractive  crystal  is  given  by 


I(x)  =  fo(l  +  "»g  coa  Kgx)  +  /i(  1  +  cos  Ksx)  (7.A1) 

then  to  first  order  the  steady  state  response  of  the  Eoi .  E io,  and  En  compo¬ 
nents  are 


1 

rr>  1  +iED{ATG) 

£|°  -  2m<:  D(Ka) 

(7.A2) 

r.  1  -eff^O  +  i£b(Ks) 

E°l  -  2m*  D(Ks) 

(7. A3) 

» 

En  =  imfm^[0(*o)E(Es)I?(tfG  +  ^s)]"1 

X  {Ei[Eo  +  i£D(*G)l  +  fj[£o  +  i£D(*s)]} 

(7.A4) 

in  which 

_  SqIo 

^  ^So/o  +  Ss/i 

(7.A5) 

1 

^  sSG/o  +  Ss/i 

(7.A6) 
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n/  t-N  ,  ■  Eo  +  iEG(  A) 

0(A,  =  1'‘  £,<*> 

(7.A7) 

P  n(K  >  .  Ed(Ks) 

Fi  D(KG)+  E'{Kg) 

(7.A8) 

F2  =  D(Ks)+^ ^ 

E,(AS) 

(7.A9) 

£m 

«0a: 

(7.A10) 

Ed(A-)  =  ^ 

(7. All) 

for  which  K  can  assume  the  values  Kq  ,  K$,  and  (Kq  +  A"s). 

7.A.2  Temporal  Responae 

The  temporal  behavior  of  the  space-charge  field  components  for  the  simulta¬ 
neous  erasure/writing  mode  (SEWM)  is  discussed  in  this  section.  Consider  the 
following  sequencing  of  light  intensity  profiles: 

7o(l  +  triQ  coa  Kqx)  for  f<0 

/o(l  +  mc  cos  A'<3*)  +  /t(l  cos  Ksx)  for  f>0 

(7.A12) 

The  temporal  evolution  of  the  various  components  of  the  space-charge  field  is 
given  by 

ElO  =  -  {(mG  -  )exp[  -  t/T(Ka)]  + 

Ebi  =  — ms{l  -  exp  [  -  </r(ATs)]}[£o  +  iED(Ks)]D-l(Ks) 

Eu  -  M0  +  Mie-‘/Tl  +  M2e*‘/r*  +  M3t-t/Ti  +  M4e~,/T* 
in  which 

Mo  =  im^mf[D(JfG)^(^s)i?(^G  +  ATs)]’1 

x  {Ei[Eo  +  iEoCTfc)]  +  Ej[Eo  +  iEcCEs)]} 

Mi  =  i(mG  -  fr^)wf[D(Xa)D(Ks)D{Ka  +  Ks)]-' 

x  |ei[Eo  +  iED(ATG)]  +  F3[Eo  +  »ED(A's)](l  -  } 


(7.A13) 

(7.A14) 

(7.A15) 

(7.A16) 

(7.A17) 


,V/2  =  -  jm'*mf[D{KGW(KS)D{KG  +  Asj]-'~V 
4  12  ~T3 

x  |f1[£o  +  i£d(A'g)](^1  -  ^  +  E2[E0  ^.£d(A's)]|  (7.A1S) 

\u  =  -  i(mG  -mg)Tnf[D(KG)D[Ks)D(KG  +  AY)]-1— 

4  •'4  -  A 

x  |fi[£b  +  i£D(^G)](l-^) 

+  £2[£o+i£D(^s)](l-^)}  (7.A19) 

M3  =  -(Mo  +  Ml  +  M2  +  M4)  •  (7.A20) 


The  dielectric  relaxation  time  constant  To  is  defined  by 


the  time  constants  Ti,  T2l  and  T3  are  defined  by 
rQC(JO 

*  t'A  )  ni  is\ 


(7.A21) 


(7.  A  22) 


for  A'  =  ATq,  AY  and  (Kq  +  A'g)  respectively;  and  the  time  constant  T4  is 
defined  by 


T'~{r,+kV  ■ 

In  (7.A22),  the  factor  C(K)  is  defined  by 


(7.A23) 


(7.A24) 


Note  that  the  GEM  temporal  response  can  be  derived  from  the  SEWM 
expression  by  setting  the  average  coherent  grating  intensity  Jo  =  0  for  t  >  0. 
Thus  the  Mo  and  Mj  terms  disappear  for  GEM,  leaving  the  Mi,  M4,  and 
M3  =  —(Mi  +•  M4)  terms.  The  GIM  response  can  in  turn  be  derived  from  the 
GEM  response. 
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I _ 

Introduction 

A  number  of  inherent  inefficiencies  exist  in  the  photorefractive  recording  of 
grating  structures  due  to  the  nature  of  the  photoexcitation  and  charge  trans¬ 
port  processes.  These  inefficiencies  can  be  quantified  by  postulating  highly 
idealized  photoexcitation  and  charge  transport  models  that  yield  optimum 
(quantum  limited )  space  charge  field  distributions,  assuming  a  photoexci- 
tation  constraint  of  no  more  than  one  mobile  charge  per  incident  photon 
By  companng  such  highly  idealized  photorefractive  recording  models  with 
more  realistic  models,  the  fundamental  origins  of  several  such  inherent  inef¬ 
ficiency  factors  can  be  identified  and  their  magnitudes  estimated.  In  this 
manner,  the  grating  recording  efficiencies  of  photorefractive  materials  can 
be  directly  compared  with  the  fundamental  physical  limitations  imposed  by 
quantum  constraints. 

In  this  chapter,  we  will  establish  the  absolute  quantum  efficiency  of  the 
photorefractive  grating  recording  process  by  deriving  the  optimum  idealized 
photorefractive  recording  model  subject  to  such  quantum  constraints  A  more 
realistic  photoexcitation  and  charge  transport  model  applicable  to  numerous 
currently  investigated  real  time  photorefractive  materials  will  then  be  ex¬ 
amined  in  depth,  with  emphasis  on  a  comparison  with  the  characteristics  of 
the  optimum  quantum  limited  model.  This  realistic  charge  transport  model, 
based  on  the  extensive  previous  work  of  numerous  authors,  is  presented  in 
such  a  manner  as  to  illustrate  its  statistical  nature  and  to  provide  a  physically 
intuitive  interpretation  of  its  principal  attributes. 

Of  all  of  the  parameters  that  seek  to  quantify  the  absolute  or  relative  per¬ 
formance  of  photorefractive  materials,  one  of  the  most  important  is  the  pho- 
torefractive  sensitivity  (von  der  Linde  and  Glass.  1975;  Micheron.  1978; 
Glass,  1978;  Gunter.  1982;  Yeh,  1987a  and  1987b;  Glass  et  al„  1987;  Val¬ 
ley  and  Klein.  1983).  This  key  parameter  is  typically  defined  in  theoretical 
analyses  either  as  the  refractive  index  modulation  obtained  in  writing  a  uni¬ 
form  grating  of  fixed  spatial  frequency  per  unit  absorbed  recording  energy 
density  (energy  per  unit  volume)  (von  der  Linde  and  Glass.  1975;  Micheron. 
1978;  Glass.  1978;  Gunter,  1982;  Glass  et  al..  1987;  Valley  and  Klein.  1983). 
or  as  the  inverse  of  the  recording  energy  density  required  to  achieve  a  spec¬ 
ified  value  of  the  diffraction  efficiency  for  a  uniform  grating  of  fixed  spatial 
frequency  in  a  material  of  given  thickness  (Valley  and  Klein.  1983;  Huig- 
nard  and  Micheron,  1976).  Alternatively,  for  purposes  of  experimental  mea¬ 
surement.  the  photorefractive  sensitivity  may  be  specified  as  the  inverse  of 
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the  recording  energy  density  required  to  reach  a  given  fraction  of  the  sat¬ 
uration  diffraction  efficiency  of  a  particular  material  (Gunter.  1982;  Amodei 
and  Staebler.  1972).  The  photorefractive  sensitivity,  and  the  fundamental 
physical  limitations  that  apply  to  it.  are  of  current  significant  interest  because 
they  establish  the  maximum  reconfiguration  rate  of  volume  holographic  op¬ 
tical  elements  (VHOEs)  <von  der  Linde  and  Glass.  1975;  Tanguay.  1985) 
at  constant  average  optical  input  power.  The  maximum  reconfiguration  rate 
of  VHOEs  is  in  turn  important  for  applications  ranging  from  massively  par¬ 
allel  interconnections  tn  optical  processing  and  computing  systems  (Tan¬ 
guay.  1985)  to  the  development  of  the  photorefractive  mcoherent-to-coher- 
ent  optical  converter  (PICOC)  (Kamshilin  and  Petrov.  1980;  Shi  et  al  .  1983; 
Marrakchi  et  al..  1985) 

A  number  of  factors  contribute  to  the  various  photorefractive  sensitivities 
characteristic  of  photoconductive.  electrooptic  materials  One  such  factor  is 
the  photogeneration  quantum  efficiency,  which  represents  the  number  of 
photogenerated  mobile  charge  earners  per  photon  absorbed  from  the  re¬ 
cording  beam(s)  A  second  factor  is  the  charge  transport  efficiency,  which 
is  a  measure  of  the  degree  to  which  the  average  photogenerated  mobile  charge 
earner  contributes  to  the  forming  space  charge  grating  after  separation  from 
its  original  sue  by  means  of  drift  and/or  diffusion  and  subsequent  trapping 
The  magnitude  of  the  space  charge  field  generated  by  a  given  space  charge 
grating  is  inversely  proportional  to  the  dielectnc  permittivity  e  of  the  pho¬ 
torefractive  material,  which  thus  contributes  a  third  factor  to  the  grating 
recording  sensitivity.  A  fourth  factor  describes  the  perturbation  of  the  local 
index  ellipsoid  (dielectnc  tensor  at  optical  frequencies)  that  results  from  a 
given  space  charge  field  through  the  electrooptic  (Pockels  or  Kerr)  effect 
And  finally,  a  fifth  factor  pertains  to  the  physical  optics  inherent  in  the  read¬ 
out  process,  whereby  the  diffraction  efficiency  and  polanzation  properties 
of  the  readout  beam  are  derived  directly  from  the  index  ellipsoid  modulation 

In  addition,  several  other  physical  quantities  factor  into  an  evaluation  of 
the  photorefractive  sensitivity,  including  the  wavelength  of  the  recording 
illumination  (to  convert  the  number  of  absorbed  photons  into  an  equivalent 
energy),  the  absorption  coefficients  of  the  material  at  the  wavelengths  of 
both  the  recording  and  readout  beams  (to  correct  for  the  fractional  absor¬ 
bance  of  the  recording  beams  and  the  fractional  transmittance  of  the  readout 
beam),  and  the  magnitude  of  the  applied  voltage  (which  significantly  alters 
the  sensitivity  characteristics  for  certain  materials  by  changing  the  nature  of 
the  dominant  charge  transport  mechanism  from  the  diffusion  regime  to  the 
drift  regime). 
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In  previous  treatments  ot  the  photoretractive  sensitivity  and  its  associated 
limits,  all  ot  the  abosementioned  factors  have  been  addressed  to  some  de¬ 
gree.  though  not  necessarily  within  a  single  unified  treatment,  or  even  within 
a  consistent  set  of  constraints,  assumptions,  and  approximations  Such  a 
unification  is  also  beyond  the  scope  of  the  present  work  In  this  chapter,  we 
present  the  results  of  a  study  in  which  we  have  approached  the  photore- 
fracttve  sensitivity  issue  from  a  somewhat  different  perspective:  that  of  the 
i absolute  quantum  efficiency  of  the  photoretractive  grating  recording  process. 
As  such,  we  attempt  to  answer  an  oft-stated  but  as  yet  unanswered  question 
as  to  the  origin  of  the  apparent  insensitivity  of  photoretractive  grating  re¬ 
cording.  particularly  in  comparison  with  the  relatively  high  sensitivities 
characteristic  of  electrooptic  spatial  light  modulators  that  utilize  a  similar 
combination  of  photoconductive  charge  separation  and  electrooptic  modu¬ 
lation  in  the  same  single  crystal  materials  (Tanguay.  1985). 

In  order  to  provide  a  quantitative  metric  that  does  in  fact  have  a  funda¬ 
mental  physical  limitation,  we  define  herein  the  grating  recording  efficiency 
of  a  photoretractive  recording  model  or  configuration  as  the  magnitude  of 
the  space  charge  field  produced  by  a  fixed  number  of  photogenerated  mobile 
charge  earners  at  a  given  spatial  frequency,  normalized  by  the  maximum 
quantum  limited  space  charge  field  that  can  be  produced  by  the  same  number 
of  photogenerated  earners  This  metric  thus  effectively  combines  the  notions 
of  a  photogeneration  efficiency  and  a  charge  transport  efficiency,  and  it  pro¬ 
vides  an  estimate  of  the  fundamental  quantum  efficiency  of  the  photore- 
fractive  grating  recording  process  as  determined  by  the  particular  photoex- 
citation  and  charge  transport  mechanism  invoked. 

In  order  to  fully  utilize  the  concept  of  the  grating  recording  efficiency, 
we  first  calculate  the  maximum  quantum  limited  space  charge  field  that  can 
be  produced  by  a  fixed  number  of  photogenerated  mobile  charge  earners, 
assuming  optimum  photoexcitation  and  redistnbution  (transport)  functions 
We  then  calculate  the  grating  recording  efficiencies  that  desenbe  several 
important  limiting  cases  with  selected  idealized  photoexcitation  and  redis¬ 
tnbution  functions.  This  in  turn  allows  for  the  definition  of  a  baseline  case 
against  which  more  realistic  charge  transport  models  can  be  compared.  Ex¬ 
amination  of  a  particular  charge  transport  model  as  a  test  case  then  indicates 
several  other  genencally  applicable  factors  that  act  to  further  decrease  the 
grating  recording  efficiency  in  various  recording  configurations.  These  re¬ 
sults  allow  the  above  analysis  to  be  conveniently  utilized  for  a  wide  range 
of  applications  of  current  technological  interest. 

The  organization  of  the  remainder  of  t*ui  chapter  is  as  follows.  A  brief 
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summary  of  the  origin,  material  dependence,  and  implications  of  the  several 
parameters  that  affect  the  photorefractive  grating  recording  sensitivity  is  pro¬ 
vided  in  the  following  section  (Section  2).  Alternative  models  of  the  pho¬ 
torefractive  recording  process  are  defined  in  Section  3.  and  the  appropriate 
grating  recording  efficiencies  are  presented  therein  for  each  case.  Assump¬ 
tions  and  limitations  common  to  all  of  the  models  are  discussed  in  Section 
3. 1  Idealized  photogeneration  and  charge  transport  models  are  defined  and 
analyzed  in  Section  3.2.  and  more  realistic  models  based  upon  the  analyses 
of  Young  et  al.  (1974)  and  Moharam  et  al.  (1979)  in  the  initial  stages  of 
recording  before  significant  space  charge  field  amplitudes  evolve,  and  of 
Kukhtarev  ( 1976)  for  temporal  evolution  with  smail  modulation  depths  of 
the  illumination  profile,  are  discussed  and  analyzed  in  Section  3.3.  Repre¬ 
sentative  grating  recording  efficiency  calculations  are  presented  in  Section 
4  for  several  common  materials  and  applications  in  order  to  illustrate  the 
effect  of  the  quantum  inefficiency  factors  on  the  overall  quantum  efficiency 
of  photorefractive  recording.  Finally,  conclusions  drawn  from  the  above 
analyses  are  discussed  in  Section  5 


2 _ 

Factors  Contributing  to  the  Photorefractive 
Sensitivity 

In  this  section,  each  of  the  five  factors  outlined  in  the  introduction  that  col¬ 
lectively  determine  the  photorefractive  sensitivity  is  briefly  discussed,  in  or¬ 
der  to  provide  a  suitable  context  for  the  derivation  of  the  quantum  limited 
grating  recording  efficiency  as  presented  in  the  following  section.  It  should 
perhaps  be  emphasized  at  the  outset  that  although  each  of  the  primary  factors 
considered  herein  affects  at  least  one  of  the  aforementioned  alternative  pho¬ 
torefractive  sensitivity  parameters,  not  all  of  the  factors  enter  into  each  de¬ 
fined  parameter. 

In  the  context  of  photorefractive  grating  recording,  the  photogeneration 
quantum  efficiency  is  related  to  the  fraction  of  the  incident  photon  flux  that 
generates  mobile  charge  earners  free  >o  participate  in  subsequent  charge 
transport  and  trapping  processes.  For  a  given  recording  wavelength,  several 
distinct  photoexcitation  processes  can  contnbute  to  the  total  absorption  coef¬ 
ficient  In  most  commonly  considered  models  of  the  photorefractive  effect, 
the  dominant  process  is  the  photogeneration  of  free  earners  from  deep  donor 
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or  acceptor  states,  such  that  only  one  sign  mobile  earner  (either  an  electron 
or  a  hole i  is  liberated  for  each  photoevent.  In  wavelength  regions  of  sig¬ 
nificant  photoconductivity,  a  second  important  process  is  the  creation  of 
electron-hole  pairs  (as  well  as  excitons  in  certain  matenals  and  matenal 
structures)  by  means  of  band-to-band  transitions.  Examples  of  photoinduced 
transitions  that  are  not  likely  to  contnbute  to  the  photorefractive  effect,  and 
hence  tend  to  reduce  the  photogeneration  quantum  efficiency,  are  mtersub- 
band  absorptions,  intraionic  level  promotions,  quantum  well  interlevel  ex¬ 
citations.  and  photochromic  charge  transfer  exchanges  Since  each  contrib¬ 
uting  process  will  in  general  be  characterized  by  its  own  charge  transport 
efficiency  (discussed  below),  it  is  most  appropriate  to  assign  separate  pho- 
togeneration  quantum  efficiencies  not  only  to  each  charge  carrier  type,  but 
also  to  each  distinct  photoproduction  origin  (or  photoexcitation  channel). 

The  inherent  absorptive  inefficiency  implied  by  a  finite  thickness  photo- 
refractive  medium  also  affects  the  overall  photogeneration  quantum  effi¬ 
ciency  In  the  case  of  a  thin  crystal  (such  that  a d  «  1.  in  which  a  is  the 
absorption  coefficient  at  the  recording  wavelength  and  d  is  the  crystal  thick¬ 
ness).  only  a  small  fraction  (=  ad)  of  the  recording  beam  intensity  is  ab¬ 
sorbed  and  hence  has  an  opportunity  to  participate  in  the  photorefractive 
process.  In  a  thicker  crystal,  for  which  the  thickness  may  be  optimized  for 
maximum  saturation  diffra.aon  efficiency,  the  absorbed  fraction  is  (1  - 
e'**]  if  only  the  entrance  surface  of  the  photorefractive  medium  is  allowed 
to  achieve  saturation.  In  this  case  the  recorded  grating  will  exhibit  an  ex¬ 
ponential  nonuniformity  throughout  the  crystal  thickness  that  will  decrease 
the  maximum  achievable  diffraction  efficiency.  If  the  entire  crystal  is  ex¬ 
posed  to  saturation,  the  photogeneration  quantum  efficiency  will  be  further 
reduced  by  a  factor  of  order  e*4 .  Finally,  the  effects  of  reflection  at  both 
front  and  rear  crystal  surfaces  reduce  the  fraction  of  incident  photons  that 
contribute  to  the  formation  of  a  given  grating  component  and.  hence,  also 
reduce  the  effective  quantum  efficiency.  For  a  crystal  thickness  optimized 
for  maximum  saturation  diffraction  efficiency  with  equal  write  and  read 
wavelengths,  and  for  indices  of  refraction  typical  of  common  photorefractive 
materials,  the  combined  effects  of  absorption  of  the  recording  beams,  ab¬ 
sorption  of  the  readout  beam,  and  reflection  losses  (assuming  uncoated  sur¬ 
faces)  on  the  photorefractive  sensitivity  is  approximately  an  order  of  magnitude. 

An  additional  effect  that  acts  to  reduce  the  photogeneration  quantum  ef¬ 
ficiency  is  the  constraint  imposed  indirectly  by  the  nature  of  the  photoex¬ 
citation  distribution.  As  we  shall  demonstrate  in  the  next  section,  the  sin¬ 
usoidal  intensity  interference  pattern  generated  by  two  coherent  recording 
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beams  is  not  the  optimum  (quantum  limited)  photoexcitation  distribution 
function. 

Perhaps  the  most  critical  factor  that  determines  the  photorefractive  sen¬ 
sitivity  is  the  charge  transport  efficiency,  in  that  this  quantity,  above  all 
others,  exhibits  the  greatest  degree  of  variation  among  commonly  investi¬ 
gated  photorefractive  materials.  The  charge  transport  efficiency  quantifies 
the  degree  to  which  the  average  photoproduced  charge  earner  contributes  to 
the  forming  space  charge  grating  following  photoexcitation,  charge  trans¬ 
port.  and  subsequent  recombination  or  trapping.  Charge  transport  in  the  re¬ 
fractory  oxides,  as  well  as  in  the  compound  semiconductor  family,  is  a  sta¬ 
tistical  process  in  which  the  net  result  of  a  given  photoinduced  event  may 
be  to  increase,  decrease,  or  leave  unchanged  the  magnitude  of  the  space 
charge  modulation  at  the  fundamental  grating  frequency 

The  statistical  nature  of  the  charge  transport  process  can  be  subsumed  in 
the  standard  band  transport  treatment  (Young  et  al..  1974;  Moharam  et  al.. 
1979;  Kukhtarev  et  al..  1976.  1979).  or  made  explicit  as  in  the  hopping 
conduction  model  (Feinberg  et  al..  1980);  both  approaches  lead  to  essen¬ 
tially  equivalent  results  (Femberg  et  al.,  1980;  Jaura  et  al..  1986).  The  charge 
transport  efficiency  is  strongly  affected  in  photorefractive  materials  by  the 
dominant  conduction  mechanism  (diffusion  and/or  drift  in  an  externally  ap¬ 
plied  field),  and  by  the  ratio  of  the  average  displacement  of  a  photoexcited 
earner  (before  recombination  or  trapping)  to  the  grating  spacing.  In  both  the 
drift  and  diffusion  regimes,  the  average  displacement  depends  primarily  on 
the  mobility-lifetime  product  of  the  photoexcited  species.  As  such,  separate 
charge  transport  efficiencies  should  be  assigned  to  each  earner  type.  A  sig¬ 
nificant  net  charge  transport  efficiency  will  be  realized  only  if  there  is  a  net 
differential  in  the  earner  displacement  and  trapping  process. 

It  should  be  noted  that  there  are  several  situations  that  can  yield  vanishing 
charge  transport  efficiencies,  even  with  large  photogeneration  quantum  ef¬ 
ficiencies.  For  example,  in  a  single  donor/single  trap  model,  if  the  initial 
Fermi  level  is  more  than  10  kaT  or  so  above  the  un-ionized  donor  level,  no 
substantial  charge  rearrangement  is  possible  due  to  the  unavailability  of  ion¬ 
ized  donors  (traps)  outside  the  regions  of  significant  photoexcitation.  Like¬ 
wise.  if  the  initial  Fermi  level  is  more  than  10  kaT  or  so  below  the  un¬ 
ionized  donor  level,  only  band-to-band  photoexcitations  can  contribute  with 
nonvanishing  photogeneration  quantum  efficiencies,  which  again  is  likely  to 
yield  negligible  charge  transport  efficiency  unless  the  mobilities  and/or  life¬ 
times  are  significantly  different,  or  unless  operation  in  the  dnft  regime  is 
engendered  by  employing  an  externally  applied  electric  field 
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The  space  charge  grating  that  results  from  the  combination  of  photoex- 
citation  and  charge  transport  processes  in  turn  gives  rise  to  a  modulation  of 
the  local  electric  tield  at  the  same  spatial  frequency  through  the  first  Max¬ 
well  equation: 


_  p(.t) 

V(tEt.r)|  =  - .  (3.1) 

«o 

in  which  E  is  the  total  electric  field  at  each  point  in  space  t.  c  is  the  dielectnc 
permittivity  tensor,  p  is  the  local  space  charge  amplitude,  and  c0  is  the  di¬ 
electric  permittivity  of  free  space.  Note  that  the  magnitude  of  the  space  charge 
field  derived  from  a  given  space  charge  grating  amplitude  is  inversely  pro¬ 
portional  to  the  grating  wave  vector,  as  implied  by  the  differential  relation¬ 
ship  expressed  in  Eq.  (3.1).  The  tensor  character  of  e  is  important  to  note, 
as  many  photorefractive  materials  (particularly  the  ferroelectric  oxides)  ex¬ 
hibit  marked  dielectnc  anisotropy.  Hence,  space  charge  gratings  onented  in 
different  directions  within  the  same  crystal  can  give  nse  to  quite  large  van- 
ations  in  the  resultant  space  charge  field.  Note  further  that  the  magnitude 
of  the  space  charge  field  scales  inversely  with  a  diagonal  component  of  the 
dielectnc  permittivity  for  a  given  space  charge  grating  onented  along  a  pnn- 
cipal  dielectnc  axis  of  the  crystal.  Thus,  materials  with  large  dielectnc  con¬ 
stants  (such  as  BaTiOj  and  SBN)  require  correspondingly  large  space  charge 
amplitudes  in  order  to  produce  an  internal  electric  field  modulation  of  given 
amplitude. 

In  the  types  of  photorefractive  materials  considered  herein,  the  index  of 
refraction  is  a  function  of  the  local  electric  field.  This  dependence  can  anse 
from  a  number  of  electrorefractive  effects,  including  among  others  the  linear 
electrooptic  (Pockets)  effect,  the  quadratic  electrooptic  (Kerr)  effect,  the  Franz- 
Keldysh  effect,  and  the  quantum  confined  Stark  effect  (Chemla  et  al. .  1985) 
In  some  materials,  notably  multiple  quantum  well  structures  in  compound 
semiconductors,  more  than  one  such  electrorefractive  effect  can  contribute 
simultaneously  to  the  establishment  of  the  resultant  index  perturbation.  For 
our  purposes  herein,  we  consider  only  the  linear  electrooptic  effect,  in  which 
the  change  in  the  dielectric  impermeability  tensor  B  (the  inverse  of  the  di¬ 
electric  tensor  c)  is  linear  in  the  electric  field: 

IB,,  =  A(«',)1J  =  r,)k£k,  (3.2) 

in  which  the  Einstein  summation  rule  is  implied,  and  in  which  rl|k  is  the 
third  rank  tensor  representing  the  electrooptic  coefficient  (Kaminow,  1974). 
As  shown  in  Eq-  (3.2).  the  tensor  nature  of  the  electrooptic  effect  implies 
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a  dependence  of  the  effective  index  of  refraction  on  the  orientation  of  the 
grating  within  the  crystal,  as  well  as  on  the  direction  of  propagation  of  the 
readout  beam  and  its  polarization.  Since  in  general  one  can  derive  an  ef¬ 
fective  electrooptic  coefficient  for  a  given  experimental  configuration.  Eq 
(3.2)  may  be  rewritten  in  the  form 

A/ictfU)  =  n>c„£(.r)  (3.3) 


in  which  the  x  coordinate  is  taken  parallel  to  the  grating  wave  vector.  A/itfr(.t) 
is  the  effective  index  modulation  resulting  from  the  combination  of  the  space 
charge  field  and  the  readout  configuration,  and  n0  is  the  corresponding  un¬ 
perturbed  index  at  the  wavelength  of  the  readout  beam 

In  discussions  of  the  photorefractive  sensitivity,  it  is  of  considerable  value 
to  combine  the  effects  of  the  previous  two  factors,  since  it  has  been  shown 
that  for  a  wide  range  of  common  photorefractive  materials,  the  ratio  nlrt„/ 
€  exhibits  considerably  reduced  variation  compared  with  that  of  each  pa¬ 
rameter  separately  (Glass  et  al..  1984:  Glass.  1984)  This  is  indicative  of 
the  general  observation  that  materials  with  large  static  polarizabilities  typi¬ 
cally  also  exhibit  concomitantly  large  perturbations  of  the  dielectric  tensor 
at  optical  frequencies  in  response  to  low  frequency  applied  (or  internal)  elec¬ 
tric  fields. 

Once  the  magnitude  of  the  index  perturbation  is  established,  the  resultant 
diffraction  efficiency  can  be  directly  determined  from  the  thickness  (and 
uniformity)  of  the  grating,  the  grating  wave  vector,  the  readout  wavelength, 
and  the  corresponding  absorption  coefficient.  Provided  that  the  grating  struc¬ 
ture  is  sufficiently  thick  to  assure  diffraction  in  the  Bragg  regime  and  that 
the  phase  and  amplitude  distortions  associated  with  self-diffraction  effects 
can  be  neglected  (Kukhtarev  et  al..  1979:  Marrakchi  et  al..  1987).  the  dif¬ 
fraction  efficiency  is  given  by  (Kogelnik.  1969) 


H  =  exp 


irAnd 
\  cos  0B 


(3.4) 


in  which  0B  is  the  Bragg  angle.  The  first  term  in  this  expression  denotes  the 
inherent  inefficiency  associated  with  finite  absorption  at  the  readout  wave¬ 
length.  while  the  second  term  derives  from  the  grating-modulation-induced 
diffraction  process.  Since  for  suitably  small  values  of  the  argument  ttA nd/ 
K  cos  0B  the  diffraction  efficiency  scales'  as  the  square  of  both  the  index 
modulation  and  the  grating  thickness,  this  photorefractive  sensitivity  factor 
is  inherently  nonlinear  and  must  be  utilized  with  considerable  caution. 


l 


ea 


R.V.  Johnson  and  A.R.  Tanguay,  Jr. 


Until  this  point  in  the  discussion,  we  have  assumed  implicitly  that  the 
grating  recording  exposures  and  spatial  frequencies  employed  have  been  large 
enough  and  small  enough,  respectively,  to  keep  the  photorefractive  record¬ 
ing  process  outside  the  additional  limitations  imposed  by  quantum  statistical 
fluctuations  For  the  recording  of  photoret'ractive  gratings  at  very  high  spa¬ 
tial  frequencies  and  at  very  low  grating  recording  exposures,  several  addi¬ 
tional  factors  will  come  into  play,  including  statistical  fluctuations  in  the 
photogeneration  process,  corresponding  fluctuations  in  the  charge  transport 
and  trapping  processes  that  yield  a  locally  inhomogeneous  charge  distribu¬ 
tion  with  spatial  frequency  components  near  that  of  the  grating  wave  vector, 
and  concomitant  variations  in  the  direction  and  magnitude  of  the  local  space 
charge  field.  These  additional  factors  may  act  to  further  reduce  the  overall 
photoret'ractive  sensitivity. 


3 _ 

The  Grating  Recording  Efficiency 

As  defined  in  the  introduction  (Section  1).  the  grating  recording  efficiency 
of  a  given  photorefractive  recording  model  or  configuration  is  the  ratio  be¬ 
tween  the  magnitude  of  the  space  charge  field  at  a  given  spatial  frequency 
produced  by  a  fixed  number  of  photogenerated  mobile  charge  earners,  and 
the  maximum  quantum  limited  space  charge  field  that  can  be  produced  by 
the  same  number  of  photogenerated  earners.  The  grating  recording  effi¬ 
ciency  is  introduced  as  a  useful  metric  that  effectively  compares  the  pho- 
togeneration  and  charge  transport  efficiencies  of  any  given  model  with  the 
optimum  quantum  limited  case,  and  as  such  provides  an  estimate  of  the 
overall  fundamental  quantum  efficiency  of  the  photorefractive  grating  re¬ 
cording  process.  Note  that  since  the  grating  recording  efficiency  is  normal¬ 
ized.  any  effect  of  the  dielectric  permittivity  tensor  in  establishing  the  mag¬ 
nitudes  of  the  space  charge  fields  cancels  out.  Hence  the  grating  recording 
efficiency  may  be  equivalently  defined  directly  in  terms  of  the  space  charge 
grating  amplitudes  or  in  terms  of  the  space  charge  fields  for  a  dielectric 
constant  of  unity. 

In  this  section,  idealized  photogeneration  and  charge  transport  models  are 
postulated  and  analyzed  in  order  to  determine  the  maximum  quantum  limited 
space  charge  amplitude  that  can  be  produced  by  a  fixed  number  of  photo- 
generated  mobile  charge  carriers  at  a  given  spatial  frequency.  The  grating 
recording  efficiencies  of  several  such  idealized  models  are  then  calculated 
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to  form  a  set  of  baseline  cases  against  which  the  corresponding  efficiencies 
of  more  realistic  photorefractive  recording  models  can  be  directly  compared. 
For  one  such  model,  that  of  a  single  mobile  charge  species  transported  be- 
p  tween  a  single  type  of  donor  site  and  its  associated  (ionized)  trap  sites,  sev¬ 

eral  factors  are  identified  that  contribute  to  the  grating  recording  efficiency 
and  that  are  particularly  illustrative  of  the  fundamental  limitations  on  the 
photorefractive  sensitivity  inherent  in  the  model. 

3.1.  Constraints  Common  to  Alternative  Models 
ft  of  Photorefractive  Recording 

The  idealized  photogeneration  and  charge  transport  models  defined  herein 
are  abstractions  of  a  more  realistic  model  that  has  been  studied  by  many 
authors  and  has  been  analyzed  in  considerable  detail  by  Kukhtarev  et  al. 
(1976.  1979).  and  as  such  these  models  share  key  assumptions  concerning 
|  quantum  recording  limitations  and  implications  of  the  readout  process.  For 

simpF  ity.  we  confine  our  attention  herein  to  a  version  of  the  model  char¬ 
acterized  by  a  single  mobile  charge  species,  and  a  single  type  of  donor  site 
with  associated  un-ionized  donor  and  ionized  donor  (trap)  states,  although 
more  intricate  models  have  been  proposed  for  particular  materials  sytems 
(e  g.,  the  existence  of  mobile  holes  as  well  as  electrons  in  lithium  mobate 
^  (Orlowski  and  Kratzig.  1978)  and  barium  titanate  (Strohkendl  et  al..  1986). 

or  the  existence  of  multiple  trap  levels  in  bismuth  silicon  oxide  (Attard  and 
Brown.  1986:  Valley.  1986)|. 

Four  principal  material  species  are  considered  in  the  single  mobile  charge/ 
single  trap  level  model,  as  diagrammed  in  Fig.  3. 1 .  The  mobile  charge  spe- 
I  cies  (usually  electrons)  has  number  density  n(x,  t)  and  is  represented  by  the 

symbol  e~  in  Fig.  3.1.  It  is  assumed  that  negligible  densities  of  these  mobile 
charges  exist  under  dark  conditions;  essentially  all  mobile  charges  are  cre¬ 
ated  by  photoionization  of  donors.  In  this  model,  donors  and  traps  are  as¬ 
sumed  to  be  different  valence  states  of  the  same  impurity  atom  (e  g.,  iron 
in  lithium  niobate)  or  lattice  defects,  as  diagrammed  in  Fig.  3.2.  The  total 
I  number  of  such  impurity  ions  or  defects  is  distributed  uniformly  throughout 

the  crystal  at  potential  donor  sites.  A  donor  is  converted  into  a  trap  (ionized 
donor),  simultaneously  with  the  creation  of  a  mobile  charge,  by  photoge¬ 
neration:  conversely,  a  mobile  charge  is  removed  from  the  conduction  band, 
and  a  trap  is  converted  into  a  donor,  by  recombination  (Fig.  3.2).  The  sum 
_  of  the  number  densities  of  donors  and  ionized  donors  is  denoted  by  /VD, 

which  is  therefore  the  total  density  of  potential  donor  sites  and  is  assumed 
to  be  constant  in  space  and  time.  If  the  number  density  of  ionized  donors 
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is  taken  to  be  .Vol.t.  t).  then  the  number  density  of  (un-ionized)  donors  is 
|.V0  -  .Vpi.r.  n|.  In  Fig.  3.1.  the  donors  are  represented  by  the  symbol  O 
and  the  traps  are  represented  by  the  symbol  -  A  fourth  material  species  is 
needed  in  this  model  to  ensure  charge  conservation,  as  the  density  of  ionized 
donors  .Vol  t,  t)  frequently  exceeds  the  density  of  mobile  charges  n(. x.  t)  by 
several  orders  of  magnitude.  This  fourth  species,  with  number  density  .V,,. 
has  traditionally  been  called  an  acceptor,  but  is  also  called  a  charge  com¬ 
pensation  site  herein  and  is  represented  in  Fig.  3.1  by  the  symbol  -  The 
charge  compensation  sites  are  electrically  negative  with  respect  to  the  donors 
and  are  presumed  to  be  negatively  charged  impurity  ions  or  lattice  defects 
incorporated  during  the  crystal  growth  process.  The  density  is  assumed 
to  be  constant  in  space  and  time,  and  is  further  assumed  to  be  numerically 
equal  to  No (.t.  t  =  0).  the  concentration  of  ionized  donors  in  equilibrium. 
In  point  of  fact,  the  only  requirement  for  charge  compensation  is  that  the 
product  of  the  charge  compensation  site  density  and  the  effective  number 
of  negative  charges  on  each  be  equal  to  Nolx.  t  =  0). 

It  is  instructive  to  consider  the  relative  densities  of  these  various  species. 
Consider,  for  example,  a  crystal  of  bismuth  silicon  oxide  tBil2SiO20)  Typ- 
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Schematic  representation  of  the  single  mobile  charge/single  trap  level  photo- 
refractive  recording  model  Indicating  both  the  uniform  distributions  and  the 
relative  densities  of  the  four  principal  material  species  involved  prior  to  grating 
recording. 
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Schematic  representation  of  the  single  active  species  photorefractive  recording 
model,  indicating  the  electron  photogeneration  land  recombination i  processes 
involved  in  the  transformation  of  donors  into  traps  tand  traps  into  donors). 


ical  concentrations  of  donors,  charge  compensation  sites,  and  mobile  charges 
under  reasonable  optical  intensities  are  of  order  10|l>  cm'5.  10'6  cm  5,  and 
no  more  that  10':  cm'5,  respectively  (Peltier  and  Micheron.  1977;  see  also 
Hou  et  al.,  1973).  (The  number  density  of  electrons  under  dark  conditions 
for  typical  crystals  of  bismuth  silicon  oxide  is  entirely  negligible.)  Thus.  .VD 
»  /Vo  =  rVA  »  n.  A  number  of  other  photorefractive  media  exhibit  similar 
proportionalities.  For  these  materials,  the  total  space  charge  density  pt.r.  t ) 
is  given  by 

p(.r,  t)  =  e[No(x,  t)  -  n(x,  D  -  rVA]  =  et/Vplx,  t)  -  A/a).  (3.5) 

The  local  density  of  ionized  donors  Nq  can  be  spatially  redistributed  under 
the  influence  of  inhomogeneous  photogeneration,  as  shown  by  comparing 
Figs.  3.1  and  3.3.  This  redistribution  occurs  as  follows.  A  photon  is  ab¬ 
sorbed  by  a  donor,  converting  the  donor  into  a  trap  (ionized  donor)  and 
generating  a  mobile  charge  carrier.  The  mobile  charge  carrier  is  transported 
some  distance  through  the  photorefractive  medium  due  to  drift  and/or  dif¬ 
fusion.  and  it  is  subsequently  captured  by  a  trap  thus  generating  another 
donor.  If  we  choose  to  follow  a  particular  electron,  the  enure  process  can 
be  viewed  as  a  simple  exchange  between  equivalent  donor  sites  of  a  donor 
state  with  an  ionized  donor  (or  trap)  state. 

As  a  final  note,  the  space  charge  profile  pu.  t)  might  evolve  into  a  highly 
distorted  profile  with  respect  to  the  distribution  of  incident  illumination  be¬ 
cause  of  nonlinearities  inherent  in  the  recording  process.  However,  since  the 
grating  readout  is  typically  performed  deep  within  the  Bragg  regime,  at  most 
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Schematic  representation  of  the  single  mobile  charge/single  trap  level  photo- 
refractive  recording  model  indicating  the  spatial  redistribution  of  the  four  prin¬ 
cipal  material  species  following  grating  recording,  note  that  the  sum  of  the 
donor  and  trap  densities  is  space-invariant.  The  symbols  identifying  each  spe¬ 
cies  are  given  in  Tig.  3. 1. 

a  very  limited  range  of  spatial  frequencies  is  effective  in  diffracting  the  read¬ 
out  light.  Thus  only  one  spatial  harmonic  of  the  space  charge  field  is  of 
interest,  which  is  assumed  herein  to  be  the  fundamental  harmonic. 

In  summary,  the  phoiorefractive  recording  model  studied  by  Kukhtarev 
and  all  idealized  photogeneration  and  charge  transport  abstractions  consid¬ 
ered  in  the  following  analysis  share  these  basic  assumptions:  a)  only  a  finite 
amount  of  space  charge  [/Vofr.  t  =  0)  =  jVJ  exists  for  generating  the  space 
charge  electric  field,  b)  this  space  charge  can  be  spatially  redistributed  under 
the  influence  of  an  illumination  pattern,  c)  no  more  than  one  mobile  charge 
is  generated  for  each  absorbed  photon  of  the  illumination  beam,  and  d)  only 
the  fundamental  spatial  harmonic  of  the  space  charge  is  effective  in  the  hol¬ 
ographic  readout  process.  The  photorefractive  recording  model  and  the 
idealized  models  differ,  however,  in  the  details  of  the  photogeneration  and 
charge  transport  processes,  as  discussed  next. 

3.2.  Idealized  Photogeneration  and  Charge  Transport  Models 
The  grating  recording  efficiency  comprises  three  successive  physical  pro¬ 
cesses:  photogeneration,  charge  transport,  and  trapping.  In  order  to  deter- 
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mine  the  maximum  quantum  limited  space  charge  fieid  (at  unity  dielectric 
constant)  that  can  result  from  a  fixed  number  of  photogenerated  mobile  charges, 
we  investigate  four  idealized  photogeneration  and  charge  transport  models, 
as  defined  below 

The  photogeneration  process  is  controlled  by  the  recording  illumination 
profile  Two  alternative  profiles  are  considered  herein,  a  periodic  comb 
function  and  a  sinusoidal  function.  The  comb  illumination  profile  /c(x).  as 
shown  in  Fig.  3.4.  is  defined  by 


/  c<  x )  —  /  c  o 


comb 


(3.6) 


in  which  the  arrow  =>  indicates  that  the  desired  function  asymptotically  ap¬ 
proaches  a  sequence  of  Dirac  delta  functions,  i.e..  a  series  of  intensity  peaks 
with  spatial  extent  small  compared  with  the  spatial  period  A«.  Unlike  a 
mathematical  delta  function,  however,  the  intended  comb  peaks  are  assumed 
to  be  large  enough  to  overlap  a  reasonable  number  of  donor  sites.  Note  that 
the  periodic  comb  function  does  not  correspond  to  any  normal  recording 
configuration.  This  is  acceptable  for  purposes  of  this  analysis  because  the 
maximum  quantum  limited  space  charge  fietd  at  unity  dielectric  constant  is 
intended  to  define  an  upper  limit  against  which  more  realistic  models  might 


rig.  3.4. 


illumination 

PROFILE 


Schematic  representation  of  the  two  principal  illumination  profiles  (comb  and 
sinusoid  functions)  utilized  in  the  idealized  photogeneration  and  charge  trans¬ 
port  models.  The  comb  function  is  illustrated  with  finite  width  to  Incorporate  a 
given  number  of  donors  at  a  predetermined  donor  density. 
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he  compared  The  sinusoid  illumination  profile  /su)  of  unity  modulation 
depth  is  defined  by 


/s<r)  = 


1  -►  cos 


(3.7) 


in  which  \G  is  the  spatial  penod  of  the  illumination  profile.  To  compare 
these  two  illumination  profiles,  the  same  photon  flux  is  assumed:  that  is. 
the  scaling  parameters  /so  and  /co  are  adjusted  such  that  the  following  nor¬ 
malization  integral  is  satisfied: 


r 

. 


/s(. t)  dx 


lc(x)  dx. 


(3.8) 


from  which  we  derive  the  relation  that  /so  =  lco  a  /„.  In  the  idealized  pho- 
togeneration  models,  all  photons  in  the  illumination  profile  are  assumed  to 
be  absorbed  and  to  generate  mobile  charge  earners. 

Two  alternative  models  of  charge  transport  and  trapping  are  also  consid¬ 
ered:  half  wavelength  translation,  and  randomization  (Figs.  3.5  and  3.6).  In 
the  half  wavelength  translation  process  shown  schematically  in  Fig.  3  5. 
each  electron  is  assumed  to  translate  precisely  half  the  grating  penod  of  the 
illumination  profile  before  capture,  without  diffusive  blooming  of  the  elec- 
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Schematic  representation  of  the  two  principal  charge  redistribution  mecha¬ 
nisms  (half  wavelength  translation  and  randomization)  utilized  In  the  idealized 
photogeneration  and  charge  transport  models,  for  the  case  of  comb  Illumina¬ 
tion.  A  single  mobile  charge  species  (electrons)  is  assumed  for  purposes  of 
illustration. 
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Schematic  representation  of  the  two  principal  charge  redistribution  mecha¬ 
nisms  (half  wavelength  translation  and  randomization)  utilized  in  the  idealized 
photogeneration  and  charge  transport  models,  for  the  case  of  sinusoidal 
illumination. 


tron  cloud.  In  the  analysis  that  follows,  it  will  be  shown  that  half  wavelength 
translation  corresponds  to  the  quantum  limited  charge  transport  process,  giv¬ 
ing  rise  to  the  maximum  grating  recording  efficiency.  The  second  idealized 
transport  and  capture  process  to  be  considered  herein  is  complete  random¬ 
ization.  as  shown  schematically  in  Fig.  3.6.  in  which  the  photogenerated 
electrons  are  assumed  to  redistribute  randomly  until  they  exhibit  no  spatial 
variation  in  density,  and  only  then  are  recaptured  by  local  traps.  The  ran¬ 
domization  model  is  intended  to  represent  a  more  realistic  charge  transport 
model;  even  so,  this  model  corresponds  to  an  upper  bound  on  realizable 
charge  transport  efficiencies,  as  shown  in  the  next  section. 

Combination  of  the  two  photogeneration  models  and  the  two  charge  trans¬ 
port  and  trapping  models  produces  four  alternative  idealized  recording  models 
for  comparison  (Fig.  3.7),  which  will  hereinafter  be  identified  as  the  bipolar 
comb,  the  monopolar  comb,  the  transport-efficient  sinusoid,  and  the  baseline 
sinusoid.  Each  of  the  four  combinations  will  be  considered  in  turn.  Grating 
recording  efficiencies  are  calculated  for  these  four  combinations  in  Section 
3.2.1,  and  the  corresponding  space-charge-fteld  saturation  behavior  is  con¬ 
sidered  in  Section  3.2.2. 
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Matrix  representation  of  the  four  idealized  photogeneration  and  charge  trans¬ 
port  models,  as  derived  from  the  principal  illumination  profiles  and  charge 
transport  mechanisms. 


3.2. 1.  Idealized  Orating  Recording  Efficiencies 

The  bipolar  comb  results  from  a  periodic  comb  illumination  function  com¬ 
bined  with  a  half  wavelength  translation.  The  resulting  space  charge  density 
pU)  induced  by  this  illumination  and  transport  combination  is  a  periodic 
comb  function  superimposed  on  a  periodic  comb  function  of  opposite  sign 
shifted  by  half  of  a  grating  wavelength,  as  shown  in  Fig.  3  8.  For  a  given 
peak  photogenerated  charge  density  po  (proportional  to  l0t.  in  which  i  is  the 
exposure  time),  the  resultant  space  charge  distribution  p(x)  is  given  by 


(  X 

i  comb  — 

L  ViV 

-J  \2  Ao/J 

This  space  charge  profile  p(x)  can  be  readily  integrated  (see  Eq.  (3.1)]  to 
yield  a  space  charge  field  E(x)  that  is  a  square  pulse  train  (Fig.  3.8).  The 
first  spatial  harmonic  component.  £, .  defined  by 


(3. 10) 


has  a  magnitude  of 


in  which  K0 


E  i  =  4e 


Po 

**o  Afc 


(3.11) 


2-rr/Ao  is  the  wave  vector  of  the  illumination  profile. 
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Space  charge  density  and  field  profiles  for  the  bipolar  comb  distribution.  The 
first  spatial  harmonic  of  the  space  charge  field  is  represented  by  the  dashed 
curve  and  is  scaled  to  the  magnitude  of  the  total  space  charge  field 


The  monopolar  comb  commands  interest  because  it  can  support  the  high¬ 
est  space  charge  field  before  saturation  due  to  limited  ionized  trap  density, 
as  discussed  in  the  next  section.  The  monopolar  comb  results  from  a  periodic 
comb  illumination  function  combined  with  electron  randomization.  The  re¬ 
sulting  space  charge  profile  pU)  induced  by  this  illumination  and  transport 
combination  is  a  periodic  comb  function  superimposed  on  a  uniform  back¬ 
ground  of  opposite  charge,  as  shown  in  Fig.  3.9  The  resulting  charge  dis¬ 
tribution  is  represented  by 

p(oc)  =  po^comb^— j  -  1  j.  (3.12) 

The  corresponding  space  charge  field  exhibits  a  sawtooth  profile  (Fig.  3  9). 
and  the  magnitude  £,  of  its  first  spatial  harmonic  is 

£,  =  2e  — — —  .  (3.13) 

«<>  K0 

The  transport-efficient  sinusoid  results  from  a  sinusoidal  illumination  pro¬ 
file  combined  with  a  half  wavelength  translation,  as  shown  schematically  in 
Fig  3.10,  leading  to  a  charge  density  pU)  given  by 
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pl.t)  =  2pn  COS 


and  a  first  spatial  harmonic  field  component  £,  of 


(3. 14) 


E,  =  2e  — — —  .  (3.15) 

K  o 

The  transport-efficient  sinusoid  combination  is  included  for  completeness, 
but  is  not  emphasized  because  it  is  neither  realistic  nor  does  it  correspond 
to  any  upper  bound  of  grating  recording  efficiency  or  saturation  performance. 

The  baseline  sinusoid,  although  seemingly  artificial,  is  of  very  pro¬ 
nounced  interest  because  it  represents  an  asymptotic  upper  bound  on  the 
grating  recording  efficiency  predicted  by  more  realistic  recording  models, 
as  discussed  in  Section  3  3  below.  The  baseline  sinusoid  results  from  a  sin¬ 
usoidal  illumination  profile  combined  with  randomization  of  the  electron 
distribution.  The  resulting  space  charge  distribution  exhibits  a  sinusoidal 
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Space  charge  density  and  field  profiles  for  the  monopolar  comb  distribution. 
The  first  spatial  harmonic  of  the  space  charge  field  Is  represented  by  the  dashed 
curve  and  is  scaled  to  the  magnitude  of  the  total  space  rharge  field. 
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Space  charge  density  and  Held  profiles  for  the  transport -efficient  (first  scale  val¬ 
ues)  and  baseline  sinusoid  (second  scale  values i  distributions.  The  space  charge 
fields  are  scaled  to  the  magnitude  of  the  transport-efficient  case. 


profile,  as  does  the  corresponding  space  charge  field,  as  shown  in  Fig.  3  10. 
The  charge  density  is  given  by: 

/  2itjc\ 

pU)  =  pocos^— j,  (3.16) 

corresponding  to  a  first  spatial  harmonic  field  component  £,  of 

Et=e-~.  (3.17) 

eto  K  g 

As  can  be  seen  from  the  analysis  above,  the  bipolar  comb  photogenera¬ 
tion/charge  transport  combination  generates  the  maximum  quantum  limited 
space  charge  field  at  unity  dielectric  constant,  and  hence  provides  the  nor¬ 
malization  constant  required  for  evaluation  of  the  grating  recording  effi¬ 
ciencies  of  both  idealized  and  realistic  photorefractive  recording  models. 
Physically,  this  optimum  combination  of  photogeneration  and  charge  trans¬ 
port  results  from  the  maximum  possible  average  separation  of  the  positive 
and  negative  charge  distributions,  as  well  as  from  the  fact  that  the  amplitude 
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ot  the  first  harmonic  of  a  square  wave  exceeds  the  amplitude  of  the  square 
wave  itself. 

The  grating  recording  efficiencies  for  the  four  idealized  photorefractive 
grating  recording  combinations  are  shown  in  Table  3. 1 .  The  most  quantum- 
efficient  recording  occurs  for  the  bipolar  comb,  which  therefore  is  assigned 
a  grating  recording  efficiency  of  unity  The  next  most  efficient  configura¬ 
tions  are  the  monopolar  comb  and  the  transport-efficient  sinusoid,  which 
both  yield  a  grating  recording  efficiency  of  0.5.  The  least  efficient  config¬ 
uration  is  the  baseline  sinusoid,  with  a  grating  recording  efficiency  of  0.25. 
As  we  shall  show  later,  the  efficiency  of  the  baseline  sinusoid  is  an  asymp¬ 
totic  upper  limit  of  more  realistic  recording  models.  Table  3.2  gives  the 
relative  photorefractive  grating  recording  sensitivities  for  these  four  cases  in 
terms  of  diffraction  efficiency  per  unit  incident  photon  flux,  assuming  low 
diffraction  efficiencies  such  that  the  efficiency  is  proportional  to  the  square 
of  the  space  charge  field  component  £, .  Note  that  by  this  measure  the  sen¬ 
sitivity  of  the  baseline  sinusoid  is  degraded  by  a  factor  of  16.  over  an  order 
of  magnitude,  compared  with  the  quantum  limit  represented  by  the  bipolar 
comb  charge  distribution  function. 

3.2.2.  Space  Charge  Saturation  for  the  Idealized  Models 

Not  only  is  the  photorefractive  recording  sensitivity  of  concern,  but  also 
saturation  limitations  of  the  space  charge  field  occurring  because  of  limited 
ionized  trap  density.  Consider,  for  example,  the  bipolar  comb  example  shown 
in  Fig.  3.8.  The  regions  of  positive  space  charge  grow  by  the  photoexcitation 
of  neutral  donors,  which  converts  them  into  positively  ionized  donors  (traps) 
and  generates  mobile  electrons.  The  electrons  are  removed  from  this  region 
by  the  various  transport  processes,  leaving  behind  the  positively  ionized  do¬ 
nors.  As  will  be  shown  later,  it  is  these  regions  of  net  positive  space  charge 
that  primarily  contribute  to  the  buildup  of  the  space  charge  field.  The  regions 
of  negative  space  charge  grow  by  the  reverse  process,  i.e..  by  capturing 
mobile  electrons  at  local  ionized  donor  sites  to  form  neutral  donors.  In  pho- 
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of  ionized  donors  available  to  capture  these  electrons.  Conversely,  the  mon¬ 
opolar  comb  combination  exhibits  the  slowest  saturation  at  the  highest  space 
charge  field  because  the  electrons  are  uniformly  distributed  throughout  the 
volume  of  the  photorefractive  medium  and.  hence,  can  be  captured  by 
all  of  the  ionized  donors.  The  saturation  characteristics  of  the  sinusoidal 
combinations  are  intermediate  between  the  bipolar  and  the  monopolar 
combinations. 

The  relative  photorefractive  grating  recording  sensitivities  (grating  re¬ 
cording  efficiencies)  and  saturation  characteristics  of  these  four  photoge¬ 
neration/charge  transport  combinations  are  schematically  diagrammed  in  Fig. 
3. 12.  The  photosensitivities  are  indicated  by  the  initial  linear  slopes  and  the 
saturation  by  the  final  space  charge  field  levels.  The  space  charge  field  is 
plotted  here  in  units  of  £q ,  which  is  defined  as  eNJtt^Ko  (Amodei.  1971). 
(This  expression  is  valid  when  the  density  of  charge  compensation  sites  ,V* 
is  much  smaller  than  the  total  density  of  potential  donor  sites  ,VD .)  Note  that 
the  saturation  level  for  the  bipolar  comb  should  really  be  much  closer  to  the 
horizontal  axis;  it  has  been  overstated  for  clarity  of  illustration. 

Having  calculated  the  grating  recording  efficiencies  and  saturation  fields 
for  these  four  highly  idealized  cases,  we  now  proceed  to  compare  these  ideal 

rig.  3.12 - 


Space  charge  field  as  a  function  of  grating  recording  time  showing  the  Initial 
linear  growth  (ph  tore  tractive  sensitivity)  and  subsequent  saturation  regimes. 
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results  with  more  realistic  photogeneration  and  charge  transport  models  in 
the  next  two  sections. 


3.3.  Realistic  Recording  Models 

The  idealized  photorefractive  recording  models  discussed  above  allow  the 
effects  of  the  photogeneration  profile  and  charge  transport  process  on  the 
grating  recording  efficiency  to  be  assessed  relative  to  the  fundamental  quan¬ 
tum  limits.  We  now  examine  a  more  realistic  model  applicable  to  a  wide 
range  of  commonly  investigated  photorefractive  media,  which  exhibits  an 
overall  efficiency  degraded  from  that  of  the  baseline  sinusoid  case  presented 
above  by  several  additional  factors  In  this  model,  the  photogeneration  pro¬ 
file  is  assumed  to  be  sinusoidal,  and  the  effects  of  the  photogeneration  quan¬ 
tum  efficiency,  absorption  coefficients,  and  reflection  losses  are  assumed  to 
be  space-invariant  efficiency  factors  and  hence  can  be  directly  incorporated 
in  any  estimate  of  the  photorefractive  grating  recording  sensitivity  Another 
major  factor  is  an  inherent  inefficiency  in  the  charge  transport  process;  this 
inefficiency  is  studied  in  Section  3.3.1  using  analytical  solutions  derived  by 
Young  et  al..  ( 1974;  see  also  Moharam  et  al..  1979).  which  are  valid  in  the 
initial  recording  interval  before  significant  space  charge  fields  have  evolved 
A  related  factor  derives  from  the  reduced  recording  sensitivity  exhibited  as 
the  space  charge  field  approaches  its  steady  state  limit.  This  is  studied  in 
Section  3.3.2  using  analytical  solutions  derived  by  Kukhtarev  (1976)  that 
describe  the  temporal  evolution  of  the  space  charge  field  in  the  limit  of  very 
low  illumination  profile  modulation  depths.  Recording  configurations  that 
generate  low  modulation  depths  are  inherently  inefficient,  as  most  of  the 
photons  in  the  illumination  contribute  a  uniform  background  photocurrent, 
and  only  a  fraction  of  the  incident  intensity  contributes  to  the  spatial  struc¬ 
ture  of  the  image.  The  modulation  depth,  therefore,  is  also  a  factor  that 
reduces  the  photorefractive  grating  recording  sensitivity.  However,  low 
modulation  depths  are  necessary  in  certain  recording  techniques  for  en¬ 
hancing  the  space  charge  field  in  the  steady  state  limit,  such  as  the  running 
grating  process  discussed  in  Section  3  3  3 

3.3.1.  Initial  Recording  Sensitivity 

To  determine  the  existence  of  degraded  charge  transport  efficiency,  the 
idealized  recording  models  discussed  in  Section  3.2  must  be  compared  with 
more  realistic  charge  transport  solutions,  such  as  those  given  by  Young  et 
al  (1974;  see  also  Moharam  et  al..  1979).  Under  norma)  recording  condi- 
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tions.  the  coupled  photorefractive  recording  equations  are  nonlinear,  makme 
analytical  solutions  difficult  or  impossible  to  derive.  However,  a  siemfi- 
cantly  simplified  analysis  can  be  utilized  during  the  initial  recording  period, 
w.ucn  enables  analytical  solutions  to  be  derived  at  least  for  certain  illumi¬ 
nation  profiles.  These  analytical  solutions  are  well  worth  studying  for  the 
physical  insight  they  furnish  into  the  charge  transport  process. 

The  analytical  simplification  described  above  derives  from  a  linearization 
of  the  recording  equations,  in  which  two  recording  parameters,  the  total 
electric  field  and  the  ionized  donor  (trap)  density,  remain  essentially  con¬ 
stant  throughout  space  during  the  initial  recording  interval  (Young  et  al.. 
1974:  Vloharam  et  al..  1979).  This  assumes  that  the  photorefractive  crystal 
is  initially  in  quasi-thermodynamic  equilibrium,  i.e..  with  a  spatially  uni¬ 
form  distribution  of  ionized  donors  .V^.r,  t  =  0)  =  .V* .  and  that  the  trap 
density  remains  essentially  constant  throughout  this  initial  recording  interval. 

The  analytical  solutions  that  exist  in  this  regime  have  typically  empha¬ 
sized  recording  with  sinusoidal  illumination  profiles.  While  such  a  profile 
corresponds  closely  with  typical  expenmental  situations  and  simplifies  the 
mathematics,  the  physics  of  the  transport  process  is  somewhat  obscured  in 
comparison  with  an  alternative  illumination  profile,  that  of  a  very  narrow 
slit,  which  can  be  viewed  as  an  approximation  of  a  Dirac  delta  function. 
Typical  trapped  electron  density  profiles  obtained  in  response  to  a  narrow 
slit  illumination  profile  are  shown  in  Fig.  3.13  for  the  cases  of  diffusion- 
only  transport  (top  illustration),  drift-only  transport  (middle  illustration),  and 
one  particular  combination  of  drift  and  diffusion  processes  (bottom  illustra¬ 
tion):  the  derivation  of  these  figures  is  described  below.  These  figures  em¬ 
phasize  several  important  features  of  a  more  realistic  transport  analysis.  The 
transport  mechanism  is  inherently  a  random  process,  with  a  spread  in  char¬ 
acteristic  transport  lengths  associated  with  a  corresponding  spread  in  charge 
carrier  lifetimes.  Useful  parameters  for  characterizing  the  transport  processes 
are  the  average  transport  lengths  Z,E  for  drift-induced  transport  and  L0  for 
diffusive  transport,  defined  as  (Young  et  al..  1974;  Moharam  et  al..  1979) 

Z.e  =  M,t£0.  (3. 18) 

and 


in  which  p  is  the  mobility  of  the  charge  carriers.  £0  is  the  applied  bias 
electric  field,  t  is  the  charge  carrier  lifetime.  D  is  the  diffusion  coefficient 
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Typical  trapped  electron  density  profiles  obtained  in  response  to  a  narrow  slit 
illumination  profile.  The  average  transport  lengths  L0  for  diffusive  transport  and 
Lt  for  drift-induced  transport  are  defined  in  the  text. 


for  the  charge  earners.  kB  is  Boltzmann's  constant.  T  is  the  crystal  temper¬ 
ature.  and  e  is  the  charge  of  an  electron.  Einstein's  relation  between  the 
diffusion  coefficient  and  the  mobility  has  been  used  in  Eq.  (3. 19).  Note  that 
in  both  cases  the  transport  lengths  are  functions  of  the  m-t  product. 

For  the  diffusion-only  case,  the  electron  spread  is  symmetneal.  which 
introduces  no  net  phase  shift  in  the  photorefractive  response  to  any  arbitrary 
illumination  profile.  For  the  drift  case,  as  well  as  for  the  combined  dnft/ 
diffusion  case,  the  electron  distribution  is  skewed  to  one  side  by  the  presence 
of  an  applied  bias  field,  which  does  in  fact  introduce  a  phase  shift  when 
recording  particular  illumination  profiles,  as  shown  in  Fig.  3.13. 

Now  consider  an  illumination  profile  Hx)  that  is  sinusoidal  and  of  the  form 

I{x)  =  /0[I  +  m  cos(  Af0x)].  (3.20) 

in  which  m  is  the  modulation  depth  of  the  light  profile  and  Kc  is  the  wave 
vector  associated  with  the  interference  pattern.  For  such  an  illumination  pro¬ 
file.  Young  et  al.  (1974;  see  also  Moharam  et  al.,  1979)  have  derived 
expressions  for  the  growth  of  the  space  charge  field  during  the  initial  re¬ 
cording  interval.  When  transport  is  dominated  by  diffusion,  the  initial  growth 
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of  the  first  harmonic  component  f,  of  the  space  charge  field  is  expressed 
by 


tea,, 

^  h  L  b 

J  ^  K'oL'b. 

(3.21) 


in  which  g„  is  the  photogeneration  rate  and  t  is  the  time  relative  to  the  ini¬ 
tiation  of  grating  recording  The  initial  growth  when  drift  transport  domi¬ 
nates  is  expressed  by 


teg  o 

KGLti  1  Kq  L4 ) 

1  V* 

_€€o^g. 

. 

(3.22) 


m  which  the  phase  shift  6  is  defined  by 


tan  <b  —  K qL e  . 


(3.23) 


The  first  bracketed  term  [teg0/t^Kc\  in  Eqs.  (3  21)  and  (3.22)  corre¬ 
sponds  to  the  grating  recording  efficiency  predicted  by  the  baseline  sinusoid 
model  (with  p„  =  teg „).  as  discussed  in  Section  3.2.  This  represents  the 
upper  bound  on  achievable  recording  sensitivity.  The  second  bracketed  terms 
in  Eqs.  (3.21)  and  (3.22)  correspond  to  an  additional  charge  transport  inef¬ 
ficiency  inherent  in  more  realistic  models  of  photorefractive  recording,  the 
subject  of  this  section.  This  transport  inefficiency  factor  is  plotted  as  a  func¬ 
tion  of  increasing  transport  length  in  Fig.  3.14  for  diffusion-only  transport 
and  in  Fig.  3.15  for  drift-only  transport.  Recall  that  these  curves  apply  only 
dunng  the  initial  recording  interval,  before  significant  space  charge  has  ac¬ 
crued.  Later  recording  will  be  characterized  by  a  lower  transport  efficiency 
because  of  the  presence  of  the  space  charge  field.  Note  in  Figs.  3.14  and 
3  15  that  the  charge  transport  efficiency  asymptotically  approaches  its  max¬ 
imum  value  in  the  limit  of  very  tong  transport  lengths,  as  intuitively  ex¬ 
pected.  although  even  in  this  limit  the  maximum  recording  sensitivity  is  that 
of  the  baseline  sinusoid,  not  that  of  the  transport-efficient  sinusoid. 

The  reason  that  the  recording  sensitivity  only  reaches  the  baseline  sinusoid 
level  in  this  limit  is  best  understood  by  considering  the  spatial  modulation 
profile  of  the  mobile  charge  density  nix).  Analytical  expressions  for  the 
modulation  depth  of  the  mobile  charge  density  can  be  readily  derived  from 
the  same  analysis  that  led  to  Eqs.  (3.21)  and  (3.22)  (Young  et  al.,  1974; 
Moharam  et  al.,  1979),  for  times  sufficiently  tong  compared  with  the  mobile 
carrier  lifetime  so  that  the  mobile  charge  density  represents  a  quasi-steady- 
state  distribution,  and  also  sufficiently  short  so  as  to  remain  in  the  initial 
recording  regime.  In  the  diffusion-only  case,  the  mobile  charge  density  is 
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Electron  density  modulation  depth  and  Charge  transport  efficiency  as  a  function 
of  the  diffusion-induced  transport  length,  in  the  diffusion-only  regime. 


n(x)  =  rg0 


2it.t\ 

1  +  mc(  KcLo)  cos( - I 
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(3.24) 


in  which  t  is  the  mobile  carrier  lifetime  and  in  which  the  modulation  depth 
m,(KQL0)  is  given  by 


mc(  KCL0) 


m 

l  K  o  L 


(3-25) 


Note  that  this  charge  distribution  has  the  least  efficient  possible  phase  for 
building  up  a  space  charge  field.  Recall  from  the  discussion  in  Section  3  2  1 
that  the  optimum  phase  of  the  mobile  charge  profile  is  a  180°  phase  shift 
with  respect  to  the  incident  illumination  profile,  allowing  the  space  charge 
field  contribution  of  the  mobile  charge  after  subsequent  trapping  to  add  to 
the  contribution  of  the  excess  positively  charged  trap  profile  produced  by 
the  photogeneration  process.  Here,  however,  the  mobile  charge  profile  is 
aligned  coincident  with  the  illumination  and  with  the  excess  positively  charged 
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ORlFT  INOUCEO  TRANSPORT  LENGTH  L 
(SCALEO  TO  THE  GRATING  PERIOD  A) 

Electron  density  modulation  depth  and  charge  transport  efficiency  as  a  function 
of  the  drift-induced  transport  length,  in  the  drift-only  regime.  The  resultant  phase 
shift  of  the  mobile  electron  distribution  is  shown  at  the  top  of  the  figure. 


trap  profiles.  As  a  consequence,  when  a  mobile  charge  is  captured,  it  re¬ 
moves  one  of  the  incremental  photogenerated  traps,  thereby  reducing  the 
overall  space  charge  profile.  A  net  space  charge  field  can  then  accrue  in  the 
diffusion-dominated  transport  case  only  by  partially  randomizing  the  mobile 
charge  distribution,  corresponding  to  a  reduction  in  the  modulation  depth 
me(KcL 0)  Note  in  Fig.  3. 14  that  the  rise  in  charge  transport  efficiency  with 
increasing  transport  length  KaL0  is  coincident  with  a  reduction  of  the  mobile 
charge  modulation  depth,  as  expected.  Optimum  charge  transport  efficiency 
occurs  when  the  mobile  charge  profile  has  been  completely  randomized, 
which,  according  to  Eq.  (3.25),  occurs  in  the  limit  L0  »  Ac. 

Similarly,  for  drift-dominated  transport,  the  mobile  charge  profile  has  a 
modulation  depth  of 


m,(KcLf)  — 


m 


(I  +  KlLl) 


1/2 


(3.26) 
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and  a  phase  shift  of  <t>.  as  given  by  Eq  <3  23)  The  charge  transpon  effi¬ 
ciency  .  ,nobi!e  .harge  modulation  depth,  and  phase  shift  of  the  mobile  charge 
profile  with  respect  to  the  incident  illumination  are  plotted  in  Fig.  3.15  as 
a  function  of  the  transport  length  K0L£  for  drift-dominated  transport,  which 
is  in  turn  proportional  to  the  applied  bias  field  £0.  For  short  transport  lengths, 
the  phase  shift  is  close  to  0°.  which  as  pointed  out  above  is  the  least  efficient 
phase  for  building  a  space  charge  field,  and  the  modulation  depth  me  exhibits 
its  maximum  value  of  m.  the  modulation  depth  of  the  illumination  profile 
For  longer  transport  lengths,  the  mobile  charge  profile  phase  shifts  away 
from  0°.  but  is  always  less  than  90°.  and  hence  always  degrades  the  net 
space  charge  profile.  The  modulation  depth  mt  similarly  decreases  with  in¬ 
creasing  charge  transport  length.  The  most  efficient  charge  transport  occurs 
for  very  large  transport  lengths.  KCL^  .  for  which  the  mobile  charge  profile 
has  become  completely  randomized. 

The  reason  that  the  phase  of  the  mobile  charge  profile  never  exceeds  a 
90°  phase  shift  for  drift-dominated  transport  (and  that  the  phase  is  always 
0°  for  diffusion-dominated  transport)  can  perhaps  best  be  appreciated  from 
Fig  3  13.  which  shows  the  mobile  charge  profile  that  is  generated  in  re¬ 
sponse  to  a  narrow  slit  (i.e..  very  tightly  focused)  illumination  profile  The 
profiles  shown  in  Fig  3. 13  can  be  derived  by  Fourier  decomposing  the  Dirac 
delta  function  of  the  illumination  profile  into  an  equivalent  set  of  spatial 
harmonics,  applying  Eqs  (3  25)  and  (3  26)  to  find  the  mobile  charge  profile 
in  response  to  each  spatial  frequency  and  then  performing  an  inverse  Fourier 
transform.  But  the  profiles  shown  in  Fig.  3.13  are  also  intuitively  reason¬ 
able.  Diffusion  tends  to  broaden  the  mobile  charge  density  symmetrically 
about  the  location  of  an  illumination  region,  whereas  drift  tends  to  pull  the 
mobile  charges  to  one  side,  in  addition,  note  that  the  mobile  charge  distri¬ 
bution  resulting  from  any  arbitrary  illumination  profile  can  be  derived  by 
convolving  the  illumination  profile  with  the  appropriate  distribution  shown 
in  Fig.  3.13  (which  can  be  considered  to  be  a  blur  function)  Because  of 
the  monotonically  decreasing  shape  of  these  blur  functions,  no  phase  shift 
in  excess  of  90°  is  feasible. 

Thus  we  find  that  the  baseline  sinusoid  model  represents  an  upper  bound 
on  the  grating  recording  efficiency  predicted  by  the  single  mobile  charge 
species/singie  donor/stngle  trap  photorefractive  recording  model. 

3.3.2.  Temporal  Approach  to  Steady  State 

In  addition  to  the  charge  transport  efficiency  factor  just  discussed,  two  ad¬ 
ditional  factors  must  be  considered  when  evaluating  any  realistic  recording 
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Mtuation  One  factor  pertains  to  the  saturation  in  temporal  growth  of  the 
-.pace  charge  field,  resulting  in  a  reduced  growth  rate  as  the  field  approaches 
its  steady  state  limit.  The  second  factor  pertains  to  the  modulation  depth  m 
of  the  illumination  profile,  which  is  often  chosen  to  be  small  to  enable  en¬ 
hanced  recording  techniques,  as  discussed  in  the  next  section. 

The  reduction  in  sensitivity  of  the  recording  process  as  the  space  charge 
field  approaches  its  steady  state  limit  can  be  assessed  from  analytic  solutions 
that  have  been  derived  by  Kukhtarev  (1976).  Typical  solutions  are  shown 
in  Fig.  3. 16  for  a  variety  of  bias  fields.  £0.  scaled  to  the  maximum  possible 
field.  £4.  due  to  limited  ionized  trap  density  (discussed  in  Section  3  2.2). 
These  curves  were  generated  based  upon  typical  charge  mobility-lifetime 
product  (p.T)  parameters  for  bismuth  silicon  oxide  (Bi^SiO^,  BSO).  as  given 
in  Table  3.3.  (A  bias  field  to  saturation  field  ratio  of  10:1  is  unphysical  for 
several  photorefractive  materials  such  as  bismuth  silicon  oxide,  requiring 
exceptionally  high  bias  fields  and/or  spatial  frequencies,  but  is  nonetheless 
included  for  generality.)  Note  that  in  all  cases  shown  the  time  needed  to 


GRATING  RECORDING  TIME 
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Temporal  approach  of  the  first  harmonic  of  the  space  charge  field  C,  to  its 
steady  state  limit  with  applied  field  as  a  parameter  for  BI,,SK)m,  assuming  a 
stationary  Illumination  profile  with  small  modulation  depth  m. 
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Tablc  3.3 

Material  Parameters  Assumed  in  the  Calculations 
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reach  saturation  is  approximately  a  factor  of  two  longer  than  that  predicted 
by  the  baseline  sinusoid  idealized  transport  model 

Due  to  the  fact  that  the  analytical  solutions  for  temporal  evolution  derived 
by  Kukhtarev  (1976)  are  valid  only  for  small  modulation  depths  m  of  the 
illumination  profile,  such  solutions  describe  low  recording  efficiency  situ¬ 
ations  in  which  the  majority  of  the  photons  in  the  illumination  beam  con¬ 
tribute  a  uniform  photocurrent  and  only  a  small  fraction  of  the  photons  con¬ 
vey  the  spatial  structure  in  the  image  profile.  This  point  is  emphasized  in 
Fig  3.17.  in  which  numerical  solutions  of  the  photorefractive  equations  are 
presented,  showing  the  temporal  evolution  of  the  first  spatial  harmonic  com¬ 
ponent  of  the  space  charge  field  for  various  modulation  depths  m.  These 
solutions  have  been  produced  by  the  authors  using  numerical  techniques  dis¬ 
cussed  by  Moharam  et  al.  ( 1979)  but  applied  to  the  full  set  of  photorefractive 
equations  proposed  by  Kukhtarev  (1976). 

In  Fig  3. 17.  a  crystal  of  bismuth  silicon  oxide  illuminated  by  a  300  cycle/ 
mm  sinusoid  has  been  assumed.  Note  in  this  figure  the  slight  oscillations 
that  can  be  observed  in  the  temporal  evolution.  The  strength  of  these  os¬ 
cillations  is  directly  dependent  on  the  charge  mobility-lifetime  product,  irr: 
the  curves  in  Fig.  3.17  correspond  to  the  material  and  configuration  param¬ 
eters  as  listed  in  Table  3.3.  It  should  be  noted  that  a  wide  variation  in  mo¬ 
bility-lifetime  products  has  been  reported,  even  for  crystals  with  nominally 
the  same  composition  (Lesaux  et  al..  1986) 

Note  also  in  Fig.  3.17  that  the  highest  space  charge  fields  are  associated 
with  the  highest  illumination  profile  modulation  depths,  a  regime  for  which 
the  time-dependent  analytical  solutions  derived  by  Kukhtarev  are  no  longer 
applicable.  Furthermore,  to  first  order  the  resultant  space  charge  field  £,. 
both  in  the  initial  recording  regime  as  well  as  in  saturation,  is  directly  pro¬ 
portional  to  the  modulation  depth  parameter  m. 
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rig.  3.17. 


GRATING  RECORDING  TIME 
(SCALEO  TO  THE  OlElECTRlC  RELAXATION  TIME) 

Increase  in  saturation  space  charge  field  with  increasing  modulation  depth  for 
BI,.,SiOI0.  showing  transition  from  linear  to  nonlinear  recording  regimes. 


3.3.3.  Enhanced  Recording  Techniques 

The  recording  configuration  just  considered  assumes  a  stationary  illumina¬ 
tion  profile  and  a  constant  applied  bias  field  (for  brevity,  hereinafter  called 
the  stationary  illumination  technique).  Space  charge  fields  with  much  higher 
steady  state  limits  can  be  obtained  by  either  of  two  alternative  nonstationary 
recording  configurations:  One  technique  is  to  translate  the  illumination  pro¬ 
file  with  respect  to  the  photorefractive  crystal  (hereinafter  referred  to  as  the 
running  grating  technique )  (Huignard  and  Marrakchi,  1981;  Stepanov  et  al., 
1982;  Valley.  1984;  Refregier  et  al..  1985).  and  the  second  technique  is  to 
periodically  reverse  the  direction  of  the  applied  bias  field  (the  alternating 
field  technique )  (Stepanov  and  Petrov.  1985).  These  techniques  do  not  im¬ 
prove  the  rate  at  which  space  charge  builds  up  with  time  under  constant 
illumination  intensity,  compared  with  the  stationary  illummation/constant 
field  recording  configuration. 

The  running  grating  technique,  in  pitfticular,  was  chosen  for  study  herein. 
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both  to  illustrate  enhanced  photorefractive  recording  concepts  and  because 
it  is  commonly  employed  to  provide  significant  amplification  of  weak  im¬ 
ages  This  technique  can  be  studied  by  a  combination  of  analytical  solutions 
that  are  valid  in  the  linearized  regime  of  very  small  modulation  depths  and 
by  numerical  solutions  for  larger  image  modulations. 

The  relative  advantage  of  the  running  grating  technique  is  summarized  in 
Fig  3.18.  which  has  been  derived  from  the  analytical  solutions  of  Valley 
( 1984)  and  Refregier  et  al.  ( 1985)  in  the  limit  of  small  illumination  profile 
modulation  depths.  A  grating  spatial  frequency  of  1 10  cycles/mm  and  pa¬ 
rameters  typical  of  bismuth  silicon  oxide  have  been  assumed  in  the  solutions 
shown  in  Fig.  3.18.  The  spatial  frequency  is  chosen  to  be  110  cycles/mm. 
rather  than  300  cycles/mm.  because  the  enhancement  of  the  steady  state 
space  charge  field  is  maximized  for  this  grating  frequency,  assuming  ma¬ 
terial  parameters  for  Bi^SiO^,  as  given  in  Table  3.3.  Note  that  the  steady 
state  limit  of  the  space  charge  field  is  in  fact  increased  by  the  running  grating 
technique  relative  to  that  obtained  with  a  stationary  grating,  but  that  the 


GRATING  RECORDING  TIME 
(SCALED  TO  THE  DIELECTRIC  RELAXATION  TIME) 


Temporal  evolution  of  the  space  charge  field,  comparing  the  cases  of  stationary 
and  running  gratings  for  Bi,}SiOn  in  the  low  modulation  limit. 
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initial  recording  sensitivities  are  asymptotically  equal.  The  initial  growth  of 
the  space  charge  field  continues  to  be  bounded  by  a  combination  of  quantum 
limitations  and  charge  transport  inefficiencies,  as  described  above  Further¬ 
more.  the  additional  factor  of  two  reduction  in  sensitivity  observed  on  ap¬ 
proach  to  saturation  obtains  for  both  solutions. 

The  magnitude  of  the  steady  state  space  charge  field  is  necessarily  quite 
small  in  the  linearized  regime  for  which  the  analytical  solutions  apply,  as 
the  field  is  proportional  to  the  modulation  depth  m  of  the  illumination  pro¬ 
file.  which  must  be  kept  small  to  ensure  accurate  analytical  solutions.  Larger 
space  charge  fields  require  larger  illumination  modulation  depths,  which  in 
turn  result  in  eventual  nonlinear  saturation  of  the  space  charge  field  itself 
Numerical  solutions  have  been  generated  by  the  authors  to  explore  the  onset 
of  this  field  saturation,  with  typical  results  as  shown  in  Fig.  3.19.  again 
assuming  the  nominal  Bii:SiOx)  material  parameters  listed  in  Table  3.3.  Note 
that  the  saturation  of  the  steady  state  field  occurs  at  quite  modest  values  of 
the  modulation  depth,  which  is  consistent  with  the  experimental  observations 
reported  by  Refregier  et  al.  ( 1985).  Thus  the  running  grating  technique  proves 
to  be  most  effective  for  amplifying  weak  images,  but  not  for  recording  large 
space  charge  fields.  For  recording  the  largest  fields,  stationary  illumination 
is  preferred. 


ILLUMINATION  PROFILE  MOOULATON  DEPTH 

Steady  state  space  charge  held  as  a  function  of  modulation  depth  for  BL  jSWx,, 
comparing  the  cases  of  stationary  and  running  gratings. 


Limitations  of  Photorefractive  Grating  Recording  Sensitivity  95 


4 _ 

Representative  Grating  Recording  Efficiency 
Calculations 

In  order  to  illustrate  the  above  concepts,  we  proceed  in  this  section  to  con¬ 
sider  the  several  factors  that  contribute  to  the  overall  grating  recording  ef¬ 
ficiency  for  two  different  types  of  materials  and  for  two  different  types  of 
applications.  The  materials  considered  are  bismuth  silicon  oxide  (Bii;SiO;i, . 
or  BSO)  and  barium  titanate  (BaTiCM;  the  principal  material  parameters 
assumed  in  the  estimates  are  listed  in  Table  3.3.  based  upon  a  set  of  values 
utilized  in  previous  related  analyses  by  Valley  and  Klein  (1983).  The  two 
applications  considered  are  those  of  reconfigurable  holographic  intercon¬ 
nections  and  the  amplification  of  weak  images. 

Let  us  first  consider  a  reconfigurable  holographic  interconnection  imple¬ 
mented  in  bismuth  silicon  oxide.  For  simplicity,  we  assume  that  only  one 
grating  with  a  spatial  frequency  of  300  cycles/mm  is  recorded  in  the  cry  stal 
To  achieve  maximum  diffraction  efficiency,  the  modulation  depth  of  the 
recording  beams  should  be  as  large  as  possible:  ideally,  m  =  1.  Also,  a  bias 
electric  field  is  typically  applied  to  crystals  of  Bii;SiO;o  to  enhance  the  pho¬ 
tosensitivity:  a  field  of  6  kV/cm  is  typical,  implying  a  drift  transport  length 
Lb  from  Eq.  (3  18)  of  9  u-m.  assuming  the  mobility-lifetime  product  given 
in  Table  3.3.  For  a  300  cycle/mm  grating  frequency,  this  implies  an  (LE/ 
\c)  ratio  of  about  2.7,  and  from  Fig.  3. 15  we  see  that  this  corresponds  to 
essentially  1007c  charge  transport  efficiency  in  the  initial  stages  of  recording 

The  recording  efficiency  for  a  Bi,;SiO:o  interconnect,  compared  with  the 
ideal  (i.e..  bipolar  comb)  quantum  efficiency  limit,  is  listed  in  Table  3.4. 
Three  factors  are  considered  in  this  and  subsequent  tables.  The  first  is  the 


Tabu  3.4 

Representative  Grating  Recording  Efficiency  Calculation.  Reconfigurable  interconnection 
Drift  Recording  in  Bismuth  Silicon  Oxide 


Modulation  depth  =  1  0 

300  cycles/mm  grating  frequency 

Space  charge  efficiency  factors 

Baseline  sinusoid/bipolar  comb 

0  25 

Charge  transport  (to  saturation) 

0  5 

Modulation  depth  factor 

1  0 

Grating  recording  efficiency 

0  125 

Diffraction  efficiency  derating  factor 

0  016 
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25%  efficiency  factor  that  applies  between  the  baseline  sinusoid  and  bipolar 
comb  cases,  a  factor  that  is  common  to  all  recording  configurations  consid¬ 
ered  in  this  section.  The  second  factor  is  the  charge  transport  efficiency, 
comparing  actual  recording  performance  to  that  of  the  baseline  sinusoid  case. 
The  Bi,:SiO:o  interconnect  is  assigned  a  50%  charge  transport  efficiency  to 
account  for  the  factor  of  two  increase  in  recording  energy  (photon  flux) 
estimated  to  reach  saturation,  as  shown  in  Fig.  3.16  and  as  discussed  in 
Section  3  3  2.  The  final  factor  is  the  modulation  depth,  which  we  have  as¬ 
sumed  to  be  unity  for  a  reconfigurable  interconnect.  Thus  the  total  grating 
recording  efficiency,  in  terms  of  the  magnitude  of  the  space  charge  field 
generated  per  unit  photon  flux,  compared  with  ideal  quantum  efficient  re¬ 
cording.  is  only  about  12.5%  for  a  Bii;SiO:o  interconnection.  This  gives  a 
diffraction  efficiency  derating  factor  of  only  16%  (assuming  a  diffraction 
efficiency  that  is  proportional  to  the  square  of  the  space  charge  field). 

As  a  second  example,  consider  image  amplification  in  Bii:SiO:o  using  a 
running  grating  enhanced  recording  technique.  We  will  again  assume  a  1 10 
cycle/mm  grating  spatial  frequency  (see  Section  3.3.3)  and  a  bias  field  of 
6  kV/cm  applied  to  the  Bi,:SiOM  crystal.  However,  the  modulation  depth 
must  be  reduced  from  100%  to  approximately  10%  to  achieve  the  peak  space 
charge  enhancement  in  saturation  provided  by  the  running  grating  recording 
technique,  as  shown  in  Fig.  3. 19  and  as  discussed  in  Section  3.3.3.  For  our 
calculations,  we  chose  a  modulation  depth  equal  to  10%.  implying  a  cor¬ 
responding  reduction  in  the  quantum  efficiency  of  the  recording  process; 
i.e..  most  of  the  photons  must  supply  the  pump  beam,  with  comparatively 
few  photons  in  the  signal  beam  containing  signal  information.  Thus  the  total 
quantum  efficiency  for  image  amplification,  shown  in  Table  3.5.  is  an  order 
of  magnitude  lower  than  that  given  in  Table  3.4  for  the  reconfigurable  in- 


Taiu3.5 

Representative  Grating  Recording  Efficiency  Calculation:  Two-Wave  Image  Amplification 
Running  Gratings  in  Bismuth  Silicon  Oxide 


Modulation  depth  *  0. 1 

1 10  cycles/mm  grating  frequency 

Space  charge  efficiency  factors: 

Baseline  sinusoid/bipolar  comb 

0  25 

Charge  transport  (to  saturation) 

0.5 

Modulation  depth  factor 

0  1 

Grating  recording  efficiency 

0.0125 

Diffraction  efficiency  derating  factor 

0  00016 
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terconnection.  Correspondingly,  the  diffraction  efficiency  derating  factor  is 
two  orders  of  magnitude  lower  for  this  case  than  for  the  case  of  the  recon- 
figurable  interconnection  and  nearly  four  orders  of  magnitude  less  efficient 
than  the  absolute  quantum  limitation. 

As  a  final  example,  consider  a  reconfigurable  interconnection  in  banum 
titanate.  with  grating  recording  efficiency  as  shown  in  Table  3  6  for  the  case 
of  recording  by  diffusion  transport  only.  Because  of  the  large  electrooptic 
coefficient  m  barium  titanate.  only  very  modest  space  charge  fields,  typi¬ 
cally  a  fraction  of  the  diffusion  field  for  a  300  cvcle/mm  grating,  are  needed 
to  achieve  peak  diffraction  efficiencies  in  crystals  of  reasonable  size  Hence 
diffusion  transport  often  proves  to  be  sufficient  in  this  matenal  in  order  to 
generate  experimentally  useful  diffraction  efficiencies  Unfortunately,  two 
factors  work  against  the  high  electrooptic  coefficient  in  banum  titanate.  One 
is  the  concomitantly  high  dielectric  constant,  which  from  Maxwell  s  first 
equation  implies  that  a  considerable  amount  of  space  charge  must  be  moved 
to  achieve  a  modest  space  charge  field  As  mentioned  in  the  introduction, 
the  combined  matenal  parameter  (ndrc„/e).  which  is  a  measure  of  the  amount 
of  optical  index  modulation  per  unit  space  charge,  proves  to  be  surprisingly 
constant  from  matenal  to  matenal  (Glass  et  al..  1984;  Glass.  1984).  Based 
upon  this  measure  alone,  banum  titanate  proves  to  be  modestly  supenor  to 
bismuth  silicon  oxide,  as  shown  in  Table  3  3 

The  second,  and  far  more  serious.  factor  degrading  grating  recording  ef¬ 
ficiency  for  diffusion  recording  in  barium  titanate  is  its  low  mobility-lifetime 
product  per.  which  is  almost  two  orders  of  magnitude  lower  than  that  for 
bismuth  silicon  oxide,  implying  a  significantly  degraded  charge  transpon 
efficiency.  The  diffusion  transport  length  L0  predicted  by  Eq.  (3.19)  is  of 
order  35  nm.  based  upon  the  mobility-lifetime  product  q.T  given  in  Table 


Table  3.6 

Representative  Grating  Recording  Efficiency  Calculation:  Reconfigurable  Interconnect 
Diffusion  Recording  in  Banum  Titanate 


Modulation  depth  3  I  0 

300  cycles/mm  grating  frequency 

Space  charge  efficiency  factors. 

Baseline  sinusoid/bipolar  comb  0  23 

Charge  transport  (to  saturation  0  002 

Modulation  depth  factor  I  0 

Grating  recording  efficiency  0  0005 

Diffraction  efficiency  derating  factor  2  5  «  10 
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3  3  A^uming  a  300  cycle/mm  grating,  this  implies  b>  Eq.  i3  21 1  a  charge 
transport  efficiency  factor  of  order  0  004.  We  include  an  additional  factor 
ot  0  5  in  Table  3  6  to  account  for  the  additional  photon  flux  required  to 
reach  saturation,  as  indicated  in  Fig  3.16  and  by  the  discussion  in  Section 
3.3  2.  Thus  we  find  that  the  grating  recording  efficiency  for  diffusion  re¬ 
cording  in  barium  titanate  is  some  250  times  less  than  that  for  dnft  recording 
in  bismuth  silicon  oxide,  in  terms  of  space  charge  generated  per  unit  incident 
photon. 

Thus  we  have  considered  representative  examples  of  several  different  ma¬ 
terials.  transport  mechanisms  and  efficiencies,  and  recording  applications. 
We  find  that,  even  in  the  most  efficient  grating  recording  configurations,  a 
significant  inefficiency  still  exists  between  the  absolute  quantum  limits  and 
actual  performance. 


5. _ _ _ _ _ 

Conclusions 

In  ih.s  chapter,  we  have  considered  a  number  of  the  fundamental  physical 
limitations  that  constrain  the  potential  performance  of  photorefractive  ma¬ 
terials  In  particular,  we  have  described  several  idealized  photogeneration 
and  charge  transport  models  in  terms  of  the  grating  recording  efficiency, 
and  we  have  identified  one  such  model  (the  bipolar  comb)  as  the  absolute 
quantum  limit  against  which  other  such  models  may  be  compared.  The  bi¬ 
polar  comb  model  generates  the  maximum  possible  fundamental  harmonic 
of  the  space  charge  field  at  unity  dielectric  constant  for  a  given  number  of 
photoexcited  mobile  charge  carriers.  A  second  idealized  model,  the  baseline 
sinusoid,  provides  an  upper  bound  for  the  grating  recording  efficiency  of  a 
more  realistic  photorefractive  grating  recording  model  involving  a  single  mobile 
charge  earner  and  a  single  donor/single  trap  photorefractive  center.  This 
upper  bound  was  shown  to  be  a  factor  of  four  less  efficient  in  generating  a 
given  space  charge  field  than  the  quantum  limitations  imply  for  an  optimum 
photogeneration  distribution  and  perfectly  efficient  charge  transport.  An  ad¬ 
ditional  factor  of  two  accrues  from  the  nonlinearity  of  the  grating  recording 
process  observed  as  the  space  charge  field  nears  saturation.  The  combination 
of  absorption  and  reflection  losses  in  typical  photorefractive  recording  con¬ 
figurations  (without  antireflection  coatings)  contributes  approximately  one 
order  of  magnitude  to  the  inefficiency  of  grating  recording  and  readout  rel¬ 
ative  to  the  incident  photon  flux.  Hence,  the  usual  photorefractive  recording 
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configuration  exhibits  an  overall  sensitivity  that  is  approximately  two  orders 
ot  magnitude  less  than  that  achievable  in  the  quantum  limit.  This  results  in 
roughly  four  orders  of  magnitude  reduction  in  the  corresponding  diffraction 
efficiency  per  incident  photon.  These  considerations  explain  to  a  certain  de¬ 
gree  why  photorefractive  recording  has  proven  to  be  relatively  insensitive 
as  compared  with  distinct  but  related  mechanisms  of  spatial  light  modulation 
Perhaps  far  more  important,  however,  are  the  implications  of  the  above 
analysis  for  the  conceptual  design  and  technological  implementation  of  op¬ 
timized  grating  recording  media  that  operate  far  closer  to  the  quantum  limits. 
For  example,  the  bipolar  comb  illumination  profile  is  not  necessarily  as  un- 
physical  as  it  may  at  first  seem,  and  it  can  be  closely  approximated  by  the 
utilization  of  stratified  volume  holographic  optical  elements  (Johnson  and 
Tanguay.  1988)  for  beam  formation  This  approach  is  perhaps  most  appro¬ 
priate  for  the  generation  of  a  grating  of  given  spatial  frequency,  as  in  the 
photorefractive  incoherent-to-coherent  optical  converter  (Mareakchi  et  a!.. 
1985)  As  a  second  example,  the  factor  of  two  inherent  in  the  approach  to 
saturation  can  be  avoided  by  seeking  photorefractive  materials  of  near  unity 
charge  transport  efficiency  and  high  electrooptic  figures  of  merit,  such  that 
experimentally  suitable  diffraction  efficiences  can  be  obtained  well  within 
the  linear  recording  regime  As  a  final  example,  the  inefficiency  implied  by 
surface  reflection  losses  can  be  dramatically  reduced,  even  over  the  broad 
spectrum  of  angles  and  wavelengths  characteristic  of  photorefractive  re¬ 
cording.  by  the  design  and  incorporation  of  appropriate  antireflection  coat¬ 
ings  (Karim  et  al..  1988) 
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PHOTONIC  IMPLEMENTATIONS  OF 
NEURAL  NETWORKS 


B.  Keith  Jenkins  and  Armand  R.  Tanguay,  Jr. 


TOWARDS  THE  DEVELOPMENT  OF  A 

NEURAL  NETWORK  IMPLEMENTATION  TECHNOLOGY 


As  described  in  other  chapters  of  this  book,  neural  networks  provide  a  different  ap¬ 
proach  to  solving  problems  as  compared  with  more  conventional  algorithmic  techniques, 
and  can  be  applied  to  a  wide  range  of  applications.  In  some  of  these  application  domains 
the  simulation  performance  of  neural  networks  has  been  comparable  to  that  of  more  con¬ 
ventional  algorithmic  approaches.  In  many  application  domains,  however,  a  realistic  (and 
therefore  large  scale)  problem  may  overwhelm  the  conventional  approach,  in  that  it  may 
be  too  computation  intensive  to  be  implemented  on  a  sequential  digital  computer,  and 
may  not  parallelize  sufficiently  well  (if  at  all)  for  efficient  computation  on  a  parallel  digital 
machine.  On  the  other  hand,  because  a  neural  network  algorithm  is  inherently  parallel,  it 
immediately  suggests  a  parallel  architecture,  which  may  in  turn  be  implemented  using  ei¬ 
ther  analog  or  digital  hardware.  And  for  the  case  of  large-scale  problems,  analog  hardware 
will  typically  provide  a  much  more  efficient  neural  implementation  than  digital  hardware. 

In  the  previous  chapter,  fully  electronic  (primarily  VLSI-based)  neural  networks  were 
described  in  which  the  primary  functionality  of  both  the  neuron  units  and  the  weighted 


(synaptic)  interconnection  matrix  is  incorporated  on  a  planar  microelectronic  chip.  An 
important  advantage  of  the  integrated  circuit  approach  to  neural  network  implementation 
is  the  capability  for  near-term  technology  insertion,  with  leverage  provided  by  a  well- 
established  technology  base  characterized  by  a  fully  developed  computer  aided  design  and 
computer  aided  manufacturing  (CAD/CAM)  device  and  circuit  repertoire.  An  equally 
important  limitation  is  the  difficulty  in  scaling  up  neural  chips  to  incorporate  large  numbers 
of  neuron  units  in  fully  (or  near  fully)  interconnected  architectures.  This  limitation  derives 
from  the  limited  pin-out,  off-chip  communication  bandwidth,  and  on-chip  interconnection 
density  available  in  both  current  generation  and  projected  chip  designs. 

In  this  chapter,  we  consider  the  utilization  of  optical  (free-space)  interconnection  tech¬ 
niques  in  conjunction  with  photonic  switching  and  modulating  devices  to  expand  the  num¬ 
ber  of  neuron  units  and  complexity  of  interconnection,  by  using  the  off-chip  (3rd)  dimension 
for  synaptic  communication.  As  we  shall  see,  the  merging  of  optical  and  photonic  devices 
with  appropriately  matched  electronic  circuitry  can  provide  novel  features  such  as  ful¬ 
ly  parallel  weight  updates  and  modular  scalability,  as  well  as  both  short  and  long  term 
synaptic  plasticity. 

Many  approaches  to  the  incorporation  of  photonic  and  optical  technology  in  the  im¬ 
plementation  of  neural  networks  are  currently  being  pursued  in  the  research  community. 
The  intent  of  this  chapter  is  not  to  present  a  review  of  these  various  approaches,  the 
details  of  which  can  be  obtained  from  several  of  the  references  given  in  the  Suggested 
Further  Reading  section  at  the  end  of  this  chapter.  Instead,  our  focus  herein  is  directed 
toward  a  description  of  key  photonic  devices  and  techniques  based  on  fundamental  optical 
phenomena,  as  well  as  toward  a  unique  and  generalizable  approach  to  their  potential  use 
in  the  implementation  of  large-scale,  highly  parallel  neural  network  architectures.  The 
unusual  nature  of  some  of  these  techniques  has  interesting  implications  for  the  design  of 
naturally  mapped  architectures  and  associated  learning/computing  algorithms.  We  will 
address  a  number  of  these  unique  features  in  the  context  of  a  description  of  the  basic  op¬ 
tical  phenomena  that  can  be  used  to  advantage,  and  of  the  array  of  photonic  devices  that 
comprise  the  system  designer’s  palette.  Because  the  incorporation  of  optical  and  photonic 
hardware  casts  the  subject  of  neural  network  implementations  in  a  somewhat  unfamiliar 


light,  we  first  discuss  a  set  of  desirable  and  requisite  characteristics  for  neural  network 
implementation  technologies. 

An  important  feature  of  any  implementation  technology  is  that  of  generality:  a  ‘'build¬ 
ing  block”  approach.  The  growth  and  synthesis  of  material  structures,  and  their  incorpo¬ 
ration  into  devices,  must  be  well  characterized,  understood,  and  repeatable,  for  a  small 
number  of  specific  material  combinations  and  device  structures.  These  devices  are  then 
assembled  into  circuits  or  architectures  for  the  implementation  of  specific  computational 
models.  This  provides  leverage  in  two  ways:  (1)  the  small  number  of  useful  and  well  un¬ 
derstood  components  are  used  repeatedly  in  different  structures  for  different  applications, 
and  (2)  architecture  and  system  level  designers  need  not  be  experts  in  the  properties  of  the 
material  and  device  structures  used  to  configure  the  components,  saving  many  man-hours 
in  the  design  of  computational  systems  over  a  completely  custom  approach.  Such  a  purely 
custom  approach  could  preclude  the  widespread  use  of  these  architectures  and  systems,  as 
has  been  characteristic,  for  example,  of  optical  information  processing  and  optical  signal 
processing  systems  over  the  past  two  decades. 

Not  only  is  it  important  for  the  implementation  technology  to  be  of  a  building  block 
nature,  but  it  is  also  important  for  the  models  underlying  the  computational  architectures 
to  support  such  an  approach,  and  in  fact  to  be  of  a  building  block  nature  themselves. 
Ideally  the  models  would  comprise  a  set  of  common  components  and  operations  at  the 
functional  level,  such  as  specific  types  of  neuron  units,  weighted  interconnections,  weight 
updates,  and  comparisons  with  desired  target  values.  Then  the  mapping  from  model  and 
functional  architecture  to  hardware  architecture,  layout,  and  implementation  can  proceed 
efficiently  as  well.  This  building  block  approach  at  both  the  model  and  hardware  levels 
has  certainly  been  characteristic  of  the  development  of  digital  electronics,  and  has  been 
largely  responsible  for  its  success. 

Assuming  that  appropriate  neural  network  models  and  a  corresponding  technology 
base  can  be  merged  within  a  compatible  building  block  approach,  neural  network  system- 
s  potentially  provide  a  unique  capability  for  large-scale  analog ,  nonlinear  computation. 
As  such,  the  neural  network  paradigm  potentially  alleviates  two  critical  bottlenecks  that 
have  impeded  the  widespread  implementation  of  large-scale  analog,  nonlinear  computing 
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systems  based  on  non-neural  architectures:  the  lack  of  appropriate  generic  hardware  com¬ 
ponents  and  of  the  sufficiently  leveraged  manpower  required  for  their  economical  design 
and  manufacture,  as  well  as  difficulty  in  establishing  efficient  techniques  for  mapping  from 
the  application  and  model  domain  onto  compatible  hardware.  With  regard  to  the  for¬ 
mer  bottleneck,  neural  network  architectures  are  generally  forgiving  with  respect  to  device 
nonuniformities  and  imperfections,  creating  much  needed  latitude  for  the  device  designer. 
With  regard  to  the  latter  bottleneck,  the  neural  network  paradigm  inherently  provides  a 
mapping  from  the  problem  domain  onto  a  highly  parallel  architecture,  which  immediately 
yields  a  starting  point  for  its  layout  in  analog  hardware. 

In  the  case  of  photonics  for  neural  network  applications,  the  hardware  technology  is 
being  developed  simultaneously  with  the  neural  computation  model(s).  This  implies  at 
least  two  things.  First,  it  is  crucial  to  retain  flexibility  in  the  functionality  of  each  com¬ 
ponent,  so  that  as  the  neural  computation  models  evolve,  the  hardware  can  evolve  along 
with  it.  The  development  of  an  entirely  new  technology  base  typically  takes  at  least  a 
decade;  such  a  delay  between  model  development  and  hardware  realization  is  generally  u- 
nacceptable.  Thus,  the  generic  technology  base  must  provide  sufficient  flexibility.  Second, 
not  only  should  the  neural  computation  model  steer  the  technology  development,  but  the 
reverse  can,  and  indeed  must ,  also  occur.  This  assures  a  mutual  compatibility  in  outcome. 

The  basic  requisite  functions  for  a  neural  network  technology  base  appear  to  be:  neu¬ 
ron  unit  response,  weighted  interconnections  (fixed  and  variable),  input/output,  learning 
computation  and  weight  update,  and  duplication  capability  (be.,  the  capability  of  making 
a  copy  of  a  network  structure).  In  addition,  other  features  are  desirable,  such  as  higher 
order  connection  capability.  The  neuron  unit  response,  at  the  most  common  and  basic 
level,  is  a  sum-of-inputs  followed  by  a  monotonic  nonlinearity.  The  nonlinearity  should 
have  the  flexibility  of  providing  different  amounts  of  gain;  for  example,  it  is  useful  to  have 
a  high  gain  to  implement  a  binary  threshold,  and  a  low  gain  to  implement  a  nearly  linear 
response.  The  neuron  unit  should  be  bipolar  in  that  it  permits  both  positive  and  negative 
inputs,  so  that  inhibitory  and  excitatory  connections  can  be  realized.  This  we  consider 
to  be  the  minimal  requisite  functionality  of  a  neuron  unit.  In  addition,  a  very  desirable 
feature  is  the  capability  for  bipolar  outputs.  Note  that  biologically  this  is  not  necessarily 
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the  case,  but  useful  neural  computation  models  will  likely  deviate  substantially  from  bio¬ 
logical  reality  and  may  require  this  capability.  Other  capabilities  are  also  desirable,  such 
as  leaky  integrator  effects  [Mead,  1989]  and  more  complex  behavior  such  as  that  required 
in  shunting  networks  [Carpenter.  1987]. 

Each  weighted  interconnection  must  store  a  learned  or  initialized  value,  and  perform 
a  multiplication  operation  on  the  signals  passing  through.  An  analog  multiplication  is 
generally  much  more  efficient  than  a  digital  one  for  reasons  of  speed  and  device  area 
or  volume.  For  this  reason,  it  is  worth  some  effort  to  provide  analog  storage  for  the 
weights.  Note  that  the  very  large  number  of  weights  used  in  many  networks  implies 
that  minimization  of  the  incorporated  hardware  complexity  of  the  requisite  storage  and 
multiplication  operations  is  crucial  for  physical  realizability.  Input/output  is  often  ignored 
at  the  higher  levels,  but  can  critically  affect  the  physical  architecture  and  can  be  a  major 
factor  in  determining  the  overall  throughput.  The  input /output  function  includes  the 
input  of  signals,  the  output  of  results,  and  the  input  of  weights  if  necessary. 

It  is  important  at  the  outset  to  distinguish  among  systems  that  have  fixed  weights; 
systems  that  have  programmable  weights  (that  are  externally  computed  but  loaded  into 
the  network),  and  systems  that  have  full  learning  capability,  in  which  the  learning  algorithm 
is  implemented  as  part  of  the  parallel  system.  The  associated  hardware  complexity  can 
be  quite  different  in  each  of  these  cases.  Finally,  duplication  capability  is  useful,  for 
example,  for  replicating  a  pre-learned  network  when  multiple  copies  of  the  network  are  to 
be  produced  and  subsequently  used  with  fixed  connections.  Probing  the  weight  values  in  a 
hardware  implementation  may  at  first  sound  straightforward,  but  implementing  very  large 
numbers  of  weights  in  a  small  volume  can  in  some  cases  preclude  such  capability. 

Applications  of  hardware  implementations  of  neural  networks  could  include  sensor  sig¬ 
nal  processing  and  fusion,  pattern  recognition,  associative  memory,  and  robotic  control. 
These  applications  imply  a  wide  range  of  hardware  requirements.  For  example,  most  vi¬ 
sion  processing  is  characterized  by  moderately  large  numbers  of  neuron  units,  with  small 
to  moderate  connectivity  and  primarily  local  interconnections.  Associative  memory,  on 
the  other  hand,  typically  requires  a  very  high  connectivity. 

Semiconductor-based  VLSI  technology  has  proven  to  be  very  capable  for  the  implemen- 
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tation  of  most,  if  not  all.  of  the  above  functions.  It  also  has  the  capability  for  integration  of 
control  circuitry  and/or  arbitrary  digital  or  logical  operations  on  the  same  chip  as  the  neu¬ 
ral  processing  circuitry.  However,  an  important  issue  in  neural  implementations  is  that  of 
scalability ,  since  many  neural  network  applications  are  likely  to  require  very  large  numbers 
of  neuron  units  and  connections.  As  pointed  out  above,  it  is  primarily  the  consideration 
of  scalability  that  leads  us  to  the  conclusion  that  purely  electronic  VLSI  will  work  well  for 
certain  applications,  but  can  benefit  greatly  from  the  incorporation  of  photonics  for  other 
applications. 

Two  important  considerations  in  VLSI  implementations  are  area  complexity  and  pinout 
requirements.  A  fully  connected  network  of  N  neuron  units  requires  area  0(N2).  One 
can  think  of  this  as  having  only  a  single  linear  dimension  available  for  the  neuron  units 
themselves,  in  order  to  leave  room  for  the  connections.  This  of  course  limits  the  size 
of  fully  connected  networks  that  can  be  accommodated  on  a  single  chip.  For  example, 
chips  have  been  fabricated  with  54  neuron  units  and  2916  ternary  (3-level)  synapses,  and 
it  is  estimated  that  approximately  700  neurons,  fully  connected  with  (105  -  106)  similar 
synapses,  could  be  implemented  on  a  CMOS  chip  using  0.5  pm  design  rules.  Learning 
capability  with  analog  synapses  may  require  substantially  more  area  per  synapse  [Jackel. 
1988].  On  the  other  hand,  networks  with  low  connectivity  and  only  local  connections 
between  neuron  units  permit  a  much  larger  number  of  neuron  units  to  be  implemented  on 
a  chip.  For  example,  the  silicon  retina  of  Mead  et  al.  comprises  a  48  x  48  array  of  neuron¬ 
like  units,  each  connected  to  its  6  nearest  neighbors  on  a  hexagonal  grid  [Mead,  1988].  A 
much  larger  number  of  neuron  units,  on  the  order  of  105,  could  be  implemented  with  such 
a  locally-connected  array,  since  the  neuron  units  and  the  connections  each  require  area 
only  O(N).  So  we  see  that  a  critical  limiting  factor  in  the  VLSI  implementation  of  neural 
networks  is  the  area  required  for  on-chip  interconnections,  and  that  the  area  required  for 
neuron  units  is  relatively  inconsequential. 

The  number  of  pinouts  that  can  be  provided  on  a  chip  is  proportional  to  its  linear 
dimension,  not  to  its  area.  This  degree  of  pinout  capacity  is  well  matched  to  fully  connected 
networks  in  which  the  weights  do  not  need  to  be  input  or  output  frequently,  and  to  locally 
connected  networks  of  low  to  moderate  bandwidth.  An  example  of  the  latter  is  a  vision 
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network  that  operates  at  video  frame  rates;  in  this  case  the  signal  can  be  easily  time 
multiplexed  onto  a  relatively  small  number  of  lines  for  communication  onto  and  off  of  the 
chip. 

On  the  other  hand,  other  application  areas  will  require  a  higher  input/output  (I/O) 
bandwidth  and/or  a  large  number  of  neuron  units  with  high  connectivity.  For  example, 
if  the  weights  are  to  be  fed  onto  and  off  of  a  chip  frequently,  substantial  multiplexing 
would  be  required  for  reasonably  large  networks  ( e.g .,  up  to  200  pinouts  can  generally 
be  accommodated,  but  as  described  above,  as  many  as  10s  -  106  synaptic  weights  can  be 
incorporated  on  the  chip).  Wafer  scale  integration  with  bump  bonding  techniques  can 
help  by  providing  large,  multi-wafer  structures,  but  the  number  of  I/O  lines  is  still  likely 
to  be  modest  due  to  practical  and  physical  constraints.  So  we  see  that  a  second  critical 
limiting  factor  in  the  VLSI  (and  wafer  scale  integration)  implementation  of  neural  net¬ 
works  is  the  number  of  I/O  lines  that  can  be  practically  incorporated.  Neural  applications 
utilizing  large  fully  connected  networks  such  as  associative  memory  would  benefit  greatly 
from  implementations  of  105  -  106  fully  connected  neuron  units,  implying  the  need  for 
1010  -  1012  synaptic  weights.  In  addition,  the  intermediate  realm  of  large  networks  with 
partial  but  moderate- to- large  connectivity  will  likely  also  prove  beneficial  to  a  wide  range 
of  applications. 

In  this  chapter,  we  discuss  a  variety  of  issues  that  impact  the  development  of  photon¬ 
ic  technology  as  applied  to  hardware  implementations  of  neural  networks  with  enhanced 
capabilities.  Photonics  has  the  potential  for  the  implementation  of  networks  with  large 
numbers  of  neurons  (10s  -  106)  and  high  connectivity  (approximately  1010  analog- weighted 
interconnections)  in  one  “module”.  The  approach  taken  here  is  to  use  electronics  to  im¬ 
plement  the  internal  function  of  each  neuron  unit,  and  to  use  optics  to  implement  the 
connections,  weights,  and  I/O.  With  this  technique  most  of  the  area  of  a  two-dimensional 
(2-D)  “chip”  can  be  used  to  implement  the  neuron  units  themselves,  and  optical  free-space 
propagation  and  volume  holograms  can  be  used  to  implement  the  interconnections.  Thus 
the  interconnections  actually  occupy  a  three-dimensional  (3-D)  volume ,  which  improves 
scalability  dramatically. 

The  next  section  of  this  chapter  describes  the  fundamental  optical  principles  and  key 
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photonic  technology  concepts  that  are  needed  for  neural  network  implementations,  a^d 
covers  photonic  analog  arithmetic,  switching,  interconnections,  sources  and  detectors.  Ar¬ 
chitectural  considerations  are  discussed  in  the  subsequent  section,  including  the  use  of 
volume  interconnections,  signal  representation,  and  desired  architectural  features.  The 
next  section  then  presents  a  photonic  implementation  strategy  that  satisfies  most  of  the 
desired  criteria.  In  the  two  concluding  sections  we  investigate  the  ultimate  limitations  of 
photonic  implementations  of  neural  networks,  and  consider  the  future  of  such  implemen¬ 
tations. 

FUNDAMENTAL  PRINCIPLES  OF 
PHOTONIC  TECHNOLOGY 


In  order  to  effectively  appreciate  the  potential  advantages  as  well  as  the  limitations  of 
extending  the  VLSI  (electronic)  repertoire  to  include  photonic  components  and  optical¬ 
ly  inspired  functionality,  we  will  first  identify  and  then  explain  a  few  truly  fundamental 
principles  of  the  optical  and  photonic  technologies  on  which  such  hybrid  neural  network 
implementations  are  based.  In  this  section,  therefore,  we  discuss  the  basic  features  of  opti¬ 
cal  analog  computation  with  both  coherent  and  incoherent  illumination  sources,  photonic 
switching  devices  and  their  various  neuron-like  functions,  the  characteristics  of  photonic  in¬ 
terconnections  that  are  essential  to  the  implementation  of  synapse-like  interneuron  wiring, 
and  the  principal  features  of  sources  (photonic  power  supplies)  and  detectors  (photonic-to- 
electronic  power  converters). 


Optical  Analog  Computation 

At  the  present  time,  most  proposed  photonic  implementations  of  neural  networks  are 
based  on  analog  operations,  both  in  the  representation  of  neuron  outputs  as  well  as  in 
the  incorporation  of  interconnection  weights.  This  emphasis  of  analog  computation  over 
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perhaps  more  familiar  digital  computation  derives  princip;  !ly  from  several  distinct  advan¬ 
tages  that  accrue  to  optical  systems  designed  to  handle  the  switching  and  interconnection 
of  very  large  numbers  of  inputs  and  outputs  at  each  circuit  node  (neuron  unit).  These  ad¬ 
vantages  include  a  significant  reduction  in  the  number  of  switching  components  required  to 
sum  multiple  inputs  [Abu-Mostafa,  1989],  an  increase  in  the  degree  of  fan-in  and  fan-out 
allowable  from  each  circuit  node,  the  elimination  of  analog-to-digital  and  digital-to-analog 
converters,  a  significant  decrease  in  signal  routing  and  interconnection  complexity,  the  po¬ 
tential  utilization  of  natural  physical  phenomena  within  certain  photonic  devices  to  accom¬ 
plish  difficult  computational  functions  directly,  and  the  possibility  of  higher  computational 
throughput  per  unit  dissipated  energy  in  operations  characterized  by  high  computational 
complexity.  Additionally,  two  notable  disadvantages  of  analog  systems,  error  accumula¬ 
tion  and  lack  of  precision  in  representation,  may  prove  to  be  relatively  unimportant  in  the 
neural  network  environment,  due  in  part  to  the  self-organizing  and  error  correcting  nature 
of  many  neural  learning  algorithms  [von  der  Malsburg,  1987],  We  will  return  to  a  number 
of  these  issues  throughout  the  remainder  of  this  chapter. 

The  computational  operations  necessary  for  the  implementation  of  a  wide  range  of 
neural  and  neural-like  networks  are  surprisingly  simple,  consisting  primarily  of  addition, 
subtraction,  multiplication,  and  nonlinear  thresholding.  The  operation  of  addition  usually 
must  be  performed  over  a  very  large  number  of  inputs  at  a  given  neuron  unit,  representing 
weighted  excitatory  signals  from  other  interconnected  neuron  units;  subtraction  is  uti¬ 
lized  to  differentiate  excitatory  inputs  from  inhibitory  inputs  to  a  given  neuron  unit,  as  is 
common,  for  example,  in  models  of  the  visual  process  and  of  associative  memories.  Multi¬ 
plication  is  necessary  for  the  provision  of  linear  interconnections  with  signal-independent 
weights,  which  in  turn  store  learned  (or  pre-programmed)  information,  and  hence  form  an 
important  constituent  of  the  neural  paradigm.  Finally,  nonlinear  thresholding  operations 
are  performed  in  order  to  provide  the  appropriate  transfer  function  between  the  neuron 
activation  potential  (sum  of  all  inputs,  both  positive  and  negative,  to  a  given  neuron  unit), 
and  the  output  each  neuron  unit  generates  in  response.  It  is  this  nonlinearity  in  particu¬ 
lar  that  gives  multilayer  neural  networks  their  computational  power,  and  allows  recurrent 
networks  to  iteratively  approach  one  of  several  stable  states  of  the  system.  The  detailed 
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nature  of  the  nonlinearity  itself  can  affect  a  number  of  critical  system  properties,  includ¬ 
ing  the  number  of  iterations  (or  equivalently  the  time)  required  to  achieve  steady  state, 
and  the  stability  of  the  network  in  the  presence  of  noise,  inaccuracy,  and  nonuniformities 
among  both  the  neuron  units  and  the  interconnections.  In  some  envisioned  neural  net¬ 
work  implementations,  it  is  important  to  also  be  able  to  alter  the  nature  of  the  nonlinear 
thresholding  function  dynamically  during  the  computational  phase  of  operation. 

In  addition  to  these  elementary  operations  (and  combinations  thereof),  it  is  necessary  to 
accommodate  operationally  for  both  the  learning  function  (which  includes  input-dependent 
interconnection  weight  updates)  and  for  the  computational  function  (which  in  general  re¬ 
quires  iterative  feedback  and  the  implementation  of  nonlinear  thresholding,  usually  with 
fixed  interconnection  weights).  The  only  additional  fundamental  operation  required  bv 
these  features,  that  of  input-dependent  interconnection  weight  updates,  can  usually  be 
reduced  to  at  most  a  combination  or  sequence  of  the  previously  described  fundamental 
operations  of  addition,  subtraction,  multiplication,  and  nonlinear  thresholding.  It  is  im¬ 
portant  to  note,  however,  that  in  this  case  the  operations  pertain  directly  to  the  implemen¬ 
tation  of  interconnection  weights,  rather  than  to  functions  performed  by  the  neuron  units; 
hence,  the  physical  processes  involved  may  in  fact  be  considerably  different  in  nature,  and 
thus  subject  to  a  quite  different  set  of  constraints.  This  differentiation  between  the  two 
different  “types”  of  basic  operations  (those  pertaining  to  the  neuron  units  and  those  per¬ 
taining  to  the  interconnection  pathways  and  weights)  is  discussed  in  more  detail  below,  as 
well  as  in  the  section  describing  the  fundamental  physical  and  technological  limitations  of 
neuro-optical  computing. 

At  the  outset,  we  must  further  differentiate  between  those  fundamental  photonic  op¬ 
erations  that  are  intended  for  “optical”  implementation,  and  those  that  are  envisioned 
for  “optoelectronic”  implementation.  In  the  first  category,  we  place  those  types  of  physi¬ 
cal  processes  in  which  one  or  more  beams  of  light  interact  either  directly  (as  in  coherent 
interference)  or  through  an  intermediate  physical  medium  (as  in  the  summation  of  two  in¬ 
coherent  beams  of  light  on  a  single  detector).  In  the  second  category,  we  place  operations 
that  involve  one  or  more  photon-to-electron  or  electron-to-photon  conversion  processes 
prior  to  the  actual  implementation  of  the  desired  function,  which  is  then  assumed  to  be 
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accomplished  using  intermediate  (primarily  analog,  but  perhaps  digital)  circuitry.  An  ex¬ 
ample  of  this  latter  category  might  be  the  subtraction  of  two  optical  signals  by  independent 
but  simultaneous  photodetection  of  each  signal,  followed  by  the  use  of  an  analog  electronic 
differential  amplifier  to  execute  the  functional  subtraction.  Most  proposed  neuro-optical 
processors  rely  to  some  degree  on  both  types  of  physical  implementation  mechanisms  (c./. 
the  sections  “Architectural  Considerations  for  Photonic  Neural  Network  Implementation¬ 
s''  and  “An  Implementation  Strategy”,  below).  In  fact,  the  eventual  degree  of  success 
achievable  by  neuro-optical  computing  techniques  rests  heavily  on  an  appropriate  balance 
of  these  mechanisms  within  a  given  system,  optimized  to  yield  the  greatest  computational 
advantage  within  the  allowable  physical  and  system  constraints. 

Optoelectronic  implementations  of  neural  functions  for  the  most  part  rely  on  optical 
signals  as  inputs  and  outputs  to  and  from  the  neuron  units  in  order  to  allow  for  both 
high  bandwidth  and  high  interconnection  multiplexing  capacity,  and  on  electronic  signal 
combination  and  processing  locally  within  each  neuron  unit.  As  such,  optoelectronic  func¬ 
tions  are  characterized  primarily  by  the  optical  characteristics  of  interconnection,  and  by 
the  electronic  characteristics  of  computation.  The  former  will  be  described  below,  while 
the  latter  has  been  discussed  both  by  Bang  Lee  and  Bing  Sheu  elsewhere  in  this  volume 
[Lee.  1991]  as  well  as  by  Carver  Mead  in  a  recent  elegant  monograph  [Mead,  1989].  On 
the  other  hand,  optical  implementations  of  these  functions  rely  on  communication  by.  as 
well  as  the  interaction  of,  two  or  more  optical  signals  in  order  to  accomplish  the  relevant 
computation,  which  is  often  (but  not  always)  followed  by  a  photon-to-electron  conversion 
process  in  some  form  of  single  channel  or  array  detector.  It  is  to  the  fundamental  principles 
of  such  optical  computational  interactions  that  we  next  turn  our  attention. 

In  order  to  adequately  consider  even  so  basic  a  process  as  optical  addition,  it  is  essential 
to  differentiate  between  two  basic  types  of  optical  interactions  (as  determined  by  the  nature 
of  the  optical  signals  involved):  incoherent  and  coherent.  Incoherent  interactions  occur 
whenever  the  light  wavefronts  representing  the  input  signals  temporally  dephase  (do  not 
oscillate  in  unison)  over  the  relevant  time  of  observation  (detector  temporal  integration 
window),  in  that  they  are  either  both  temporally  incoherent  at  the  outset,  or  are  each 
temporally  coherent  but  separated  in  optical  frequency  by  more  than  the  inverse  of  the 


observation  time.  Interactions  in  which  the  input  optical  signals  spatially  dephase  over 
the  aperture  of  the  relevant  detector  wherever  the  output  is  utilized  (detector  spatial 
integration  window)  are  also  incoherent  for  all  practical  purposes,  and  will  obey  incoherent 
summation  rules  as  given  below.  Coherent  interactions  occur,  on  the  other  hand,  whenever 
the  light  wavefronts  representing  the  input  signals  simultaneously  maintain  a  constant 
phase  relationship  over  the  detector  spatial  and  temporal  integration  windows. 

From  these  remarks,  it  can  be  seen  that  it  is  quite  important  to  understand  the  dis¬ 
tinction  between  coherent  (or  incoherent)  light  and  coherent  (or  incoherent)  interactions  as 
defined  by  the  eventual  detector  configuration  and  operational  parameters.  For  example, 
it  is  perfectly  acceptable  to  consider  a  situation  in  which  two  mutually  coherent  optical 
beams  interact  to  produce  an  interference  pattern  with  a  spatial  scale  small  compared  with 
the  relevant  detector  aperture.  In  such  cases,  the  interaction  will  in  fact  follow  incoherent 
summation  rules,  as  the  detector  effectively  integrates  the  space-variant  interference  pat¬ 
tern  over  the  full  detector  aperture  to  produce  exactly  the  same  result  as  the  interaction 
of  two  mutually  incoherent  (temporally)  optical  beams. 

Given  these  preconditions,  then,  the  actual  rules  for  the  basic  operations  are  quite  s- 
traightforward.  Consider  first  the  case  of  addition  of  two  incoherent  optical  signal  beams 
in  a  collinear  geometry,  in  which  the  two  distinct  input  beams  with  intensities  I\  and  I2 
are  assumed  to  emerge  from  a  beam-combining  optical  system  (as  yet  undetermined)  such 
that  the  two  output  beams  are  collinear.  If  the  beams  are  combined,  for  example,  by 
a  nondispersive  (wavelength-insensitive)  50/50  (50%  transmission,  50%  reflection)  beam¬ 
splitter  as  shown  in  Figure  15.1(a),  two  possible  output  beams  /out  and  /'ut  are  created, 
each  with  an  intensity  given  by: 


/«.  =  C  =  !(/.  + /»)•  (i) 

The  output  intensity  is  thus  linearly  proportional  to  the  sum  of  the  input  intensities.  Note 
that  this  operation  cannot  be  accomplished  without  an  inherent  loss,  in  the  case  shown 
above  equal  to  0.5  or  about  3  dB.  In  fact,  if  we  wish  to  combine  N  beams  collinearly  by 
this  technique  (using  a  linear  chain  of  non-dispersive  beamsplitters),  N  —  1  beamsplitters 


are  required  with  transmissivities  given  by  1/2,  2/3,  3/4,  ...,  (N  —  1)/Ar,  representing  a 
total  loss  of  (Ar  —  1  )/N  with  an  overall  throughput  of  1/Ar: 

hut  =  ^r(A  +  h  +  A  +  • '  ‘  +  Is).  (2) 

Instead  of  using  a  linear  chain  of  beamsplitters  with  different  transmissivities,  we  could 
alternatively  construct  a  binary  tree  structure  by  pairing  the  inputs  that  again  requires 
.'V  —  1  beamsplitters,  but  in  this  case  with  equal  transmissivities  of  |.  This  system  of 
beamsplitters  also  exhibits  an  overall  throughput  of  jj  (for  even  values  of  N).  Although 
beam  combination  of  a  large  number  of  inputs  by  the  multiple  beamsplitter  method  is 
impractical,  we  will  utilize  this  result  a  little  later  in  order  to  understand  the  essential  fea¬ 
tures  of  holographic  beam  combiners,  which  are  subject  to  many  of  the  same  constraints. 


FIGURE  15.1  Illustration  of  optical  addition  utilizing  a  50/50  beamsplitter: 
(a)  collinear  incoherent  beam  geometry;  (b)  collinear  coherent  beam  geometry, 
showing  input  and  output  amplitudes ;  (c)  collinear  coherent  beam  geometry, 
showing  input  and  output  intensities. 


It  should  be  noted  in  passing  that  the  overall  throughput  loss  implied  by  Equation  (2) 
can  be  circumvented  if  the  beams  to  be  summed  incoherently  are  sufficiently  distinct  in 
wavelength  that  a  dichroic  mirror  can  be  used  to  combine  them.  A  dichroic  mirror  reflects 
light  within  a  given  wavelength  range,  and  transmits  light  outside  of  that  range.  Multi¬ 
ple  dichroic  mirrors  can  be  used  to  collinearly  sum  multiple  beams  through  appropriate 
choice  of  the  ’  lput  wavelengths  in  each  arm,  and  of  the  characteristic  wavelengths  of  each 
succeeding  dichroic  mirror. 

Consider  next  the  case  of  addition  of  two  coherent  optical  signal  beams  in  a  collinear 
geometry.  An  example,  again  using  a  beamsplitter,  is  shown  schematically  in  Figure 
15.1(b).  The  presence  of  the  beamsplitter  generates  a  7 r/2  phase  shift  for  each  transmitted 


beam,  and  a  7r  phase  shift  for  each  reflected  beam  [Haus,  1984].  In  this  case,  due  to  the 
coherent  nature  of  the  two  input  signal  beams,  the  output  intensity  is  no  longer  given 
simply  by  the  sum  of  the  input  intensities.  In  fact,  the  output  amplitude  is  proportional 
to  the  sum  of  the  input  signal  amplitudes  (provided  that  the  two  beams  have  identical 
polarizations): 


lout  =  —7=  ( ? a  1  -  a2e'°) 
v2 


(3) 


in  which  <t>  is  the  relative  phase  between  the  two  wavefronts  (here  assumed  constant  over  the 
detector  aperture).  It  should  be  noted  parenthetically  that  optical  beams  with  orthogonal 
polarizations  do  not  interfere,  and  hence  follow  the  incoherent  addition  rule.  We  assume 
throughout  this  chapter  that  all  beams  are  polarized  identically. 

From  this  simple  equation,  several  important  principles  can  be  seen  to  emerge.  First, 
the  representation  we  must  choose  for  simple  addition  to  occur  with  coherent  light  is  differ¬ 
ent  than  in  the  case  of  incoherent  light:  in  the  coherent  case,  we  must  use  the  amplitudes 
(containing  phase  information),  whereas  in  the  previous  (incoherent)  case  addition  is  lin¬ 
ear  in  the  intensities.  Second,  the  input-output  transformation  represented  by  Equation 
(3)  reveals  an  easy  method  for  implementing  both  addition  and  subtraction:  we  merely 
set  the  phase  difference  to  — 7r /2  for  addition,  and  to  n/2  for  subtraction.  This  can  be 
accomplished  either  by  adjusting  the  relative  path  lengths  of  the  two  input  beams,  or  by 
inserting  an  appropriately  oriented  wave  plate  in  one  of  the  two  beams.  In  the  incoherent 
case  treated  above,  no  such  algorithm  exists  since  we  are  adding  intensities  (which  are 
positive  definite  quantities),  and  direct  subtraction  is  not  possible  without  intermediate 
intervention  by  an  active  optical  or  optoelectronic  device.  Third,  note  that  the  second 
output  beam  is  now  not  symmetric  with  the  first: 


a'out  = -^(-a\  +  ia2e'*)-  (4) 

This  asymmetry  in  output  amplitudes  directly  results  from  the  asymmetry  between  the 
phases  of  the  reflected  and  transmitted  components  in  a  partially  transmitting  mirror 
(Haus,  1984]. 
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Since  the  intensity  of  an  optical  beam  is  related  to  its  amplitude  by  the  relation: 


•fm  —  ( )  ‘  &m  ( 0 ) 

in  which  am  is  the  vector  amplitude  of  the  wave  representing  its  polarization,  *  represents 
the  complex  conjugation  operation,  and  T  represents  the  transpose  of  the  vector,  the 
output  intensities  in  the  two  coherently  summed  channels  are  given  by: 


■^out  —  —  Qj  +  -f-  2a.\a.2  sin<2>] « 


(6) 


and 

Ct  =  2 [ai  +  a-l  ~  2aia2  sin©], 


(7) 


as  showm  in  Figure  15.1(c).  From  these  two  equations,  it  can  be  seen  that  for  arbitrary 
values  of  the  phase  shift  <£,  the  output  intensity  is  not  simply  related  to  the  sum  of  the  input 
intensities,  but  instead  has  a  seemingly  undesirable  cross  term.  We  can  use  this  cross  term 
to  advantage  by  noting  that  for  phase  shifts  of  0  or  any  integer  multiple  of  rr,  both  output 
intensities  are  equal,  and  reduce  to  the  expression  previously  noted  for  the  incoherent  case 
(Equation  (1)).  Thus  for  coherent  illumination,  we  can  either  perform  addition  directly 
with  the  amplitudes,  or  with  the  intensities  if  we  are  careful  about  proper  phasing  of  the 
input  signals.  One  difficulty  with  the  former  approach  is  that  most  detectors  are  linear  in 
intensity  but  not  in  amplitude,  as  will  be  discussed  in  further  detail  below. 

In  direct  analogy  with  the  analysis  presented  for  the  case  of  incoherent  multiple  signal 
beam  summation,  we  can  extend  the  above  equations  to  include  the  case  of  multiple 
collinear  coherent  inputs  using  appropriate  combinations  of  beamsplitters.  For  N  coherent 
input  beams  summed  optimally,  the  output  amplitude  is  given  by: 


"  vOv 

In  order  to  achieve  this  result,  one  must  again  use  N  —  1  beamsplitters  with  (intensity) 
transmissivities  identical  to  those  employed  in  the  incoherent  case,  and  in  addition  the 


[a  i  +  a2  4-  03  +  ■  •  •  4-  a/v). 


(8) 
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relative  phases  must  be  arranged  such  that  the  phase  difference  between  a2  and  a]  is 
—  7t/2,  and  each  successive  beam  is  increased  in  phase  by  n/2. 

In  all  of  the  cases  described  above,  we  have  constrained  the  problem  by  requiring  that 
the  output  beams  all  be  collinear,  and  in  fact  many  proposed  neuro-optical  architectures 
implicitly  demand  such  a  constraint.  We  will  show  in  later  sections,  however,  that  this 
is  perhaps  an  unnecessary  and  in  many  cases  undesirable  constraint.  Hence  we  consider 
here  also  the  case  of  optical  addition  w'ith  noncollinear  output  beams,  requiring  instead 
only  that  the  summed  beams  fill  the  same  detector  aperture.  There  are  in  fact  a  number 
of  interesting  variants  of  these  two  constraints,  but  we  will  limit  the  discussion  to  the  two 
principal  cases  only.  One  possible  such  configuration  is  shown  in  Figure  15.2(a),  in  which 
two  incoherent  signal  beams  are  summed  within  the  detector  aperture  by  using  two  1 009c 
reflecting  mirrors,  producing  an  output  intensity  given  by: 

hut  =  h  +  / 2  •  (9) 

This  output  intensity  is  uniform  across  the  detector  aperture,  as  shown  schematically  in 
the  figure.  In  addition,  relaxation  of  the  requirement  for  collinearitv  can  be  seen  to  now 
allow  for  the  use  of  mirrors  instead  of  beamsplitters,  eliminating  the  loss  we  found  in  the 
previous  case  at  the  expense  of  increased  angular  multiplexing. 


FIGURE  15.2  Illustration  of  optical  addition  utilizing  minors:  (a)  angular¬ 
ly  multiplexed  incoherent  beam  geometry;  (b)  angularly  multiplexed  coherent 
beam  geometry. 


Up  to  this  point  in  the  discussion,  we  have  had  to  consider  the  phase  of  the  optical 
wavefronts  only  for  the  case  of  collinear,  coherent  addition.  In  that  case,  we  only  needed 
to  use  the  relative  phase  shift  between  the  two  beams  to  derive  Equations  (6)  and  (7), 
since  the  phase  shift  is  constant  in  both  space  and  time  over  the  detector  spatial  and 


temporal  integration  windows.  In  order  to  consider  the  case  of  noncollinear,  coherent 
addition,  however,  we  must  allow  for  the  space- variant  phase  shifts  that  naturally  result 
when  two  coherent  wavefronts  cross  at  a  non-zero  angle.  These  effects  are  automatically 
taken  into  account  if  we  express  each  w'ave  (beam)  amplitude  in  a  form  that  incorporates 
both  its  magnitude  and  its  phase  everywhere  in  space  at  a  given  instant  of  time.  For  a 
plane  wave  (in  which  the  planes  of  constant  phase  are  oriented  normal  to  the  direction 
of  propagation),  it  proves  convenient  to  use  the  form  aexp(zk-r),  in  which  a  is  a  vector 
representing  the  polarization  of  the  optical  wave  (its  amplitude  in  each  of  the  principal 
coordinate  directions),  k  is  the  wave  vector  of  the  optical  wave  (defined  as  a  vector  with 
direction  normal  to  the  planes  of  constant  phase,  and  with  magnitude  \k\  =  27rn/A,  in 
which  n  is  the  refractive  index  of  the  propagation  medium  and  A  is  the  wavelength  of  the 
light  wave  in  vacuum),  and  r  is  a  position  vector  defined  from  an  arbitrary  origin  in  space 
(r  =  xx  -f  yy  +  zz,  in  which  x,y,  and  z  are  unit  vectors  along  the  Cartesian  coordinate 
axes. 

For  the  case  of  coherent  illumination,  then,  the  result  of  noncollinear  summation  is  as 
shown  schematically  in  Figure  15.2(b).  In  this  case,  the  phase  of  each  input  wave  varies 
across  the  detector  aperture  (assumed  to  lie  in  the  plane  z  =  0)  at  a  rate  that  is  a  function 
of  the  angular  separation  of  the  incident  beams,  as  well  as  of  the  angular  deviation  of  the 
bisector  from  normal  incidence.  This  condition  can  be  represented  by  writing  the  wave 
amplitudes  with  a  space-variant  phase,  which  is  in  turn  dependent  on  the  x-component 
of  the  wave  vector  kx  in  the  form  a\e\p{ikxx).  The  two  waves  will  thus  interfere  in  the 
plane  of  the  detector,  forming  essentially  a  new  wave  with  a  local  amplitude  given  by  the 
sum  of  the  incident  amplitudes.  The  resulting  intensity  pattern  has  both  a  space-invariant 
(uniform)  and  a  space-variant  (sinusoidal)  component: 

hut  —  a?  +  <*2  +  2aia2  cos  2kzx.  (10) 

If  we  assume  that  the  detector  is  linear  in  intensity  over  the  dynamic  range  represented 
by  this  equation,  and  furthermore  that  the  detector  is  uniform  in  responsivity  over  its 
aperture,  then  the  output  from  the  detector  will  be  the  spatial  average  of  this  interference 
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pattern,  resulting  in  an  output  intensity  that  is  in  fact  a  sum  of  the  input  intensities,  as 
represented  by  Equation  (9).  In  this  case,  the  result  is  independent  of  the  relative  phase 
shift  (difference)  between  the  two  beams  at  their  points  of  entry  into  the  beam  combiner 
system,  since  such  a  phase  shift  will  merely  result  in  a  translation  of  the  interference 
pattern  without  altering  its  integrated  value.  This  result  also  can  be  extended  to  the  case 
of  multiple  input  signal  beams,  with  the  stipulation  that  mirrors  cannot  be  allowed  to 
occlude  each  other;  hence,  for  a  given  beam  width,  only  a  certain  number  of  beams  can 
be  combined  without  loss  by  means  of  this  method  without  overcrowding  the  available 
angular  spectrum. 

The  operation  of  optical  multiplication  is  fundamentally  different  in  a  number  of  ways 
from  those  of  addition  and  subtraction.  Perhaps  the  most  important  difference  is  that  the 
multiplication  of  either  beam  amplitudes  or  intensities  cannot  be  accomplished  directly, 
but  must  instead  utilize  a  nonlinear  medium  of  some  form  within  which  the  beams  can 
interact.  There  are  two  principal  types  of  interactions  to  consider:  those  in  which  the  two 
beams  must  be  present  simultaneously  in  order  to  form  the  desired  product,  and  those 
in  which  the  two  beams  are  utilized  sequentially  in  time.  In  general,  the  former  interac¬ 
tions  tend  to  operate  on  the  amplitudes  and  hence  require  mutual  coherency,  whereas  the 
latter  interactions  typically  form  products  of  (incoherent  or  coherent)  intensities,  which 
are  therefore  more  straightforward  to  detect  with  currently  available  intensity-sensitive 
detectors. 

Simultaneous  multiplication  of  two  optical  beams  is  suggested  by  Figure  15.2(b),  in 
which  two  coherent  signal  beams  are  angularly  multiplexed  to  form  the  interference  pattern 
given  by  Equation  (10).  Note  that  the  space- variant  part  of  the  output  intensity  in  the 
plane  of  the  “detector”  is  proportional  to  the  product  of  the  amplitudes,  i.e.,  is  of  the 
form  2aia2  cos2 kzx.  If  instead  of  employing  a  uniform  (spatially  averaging)  detector  as 
before,  we  were  now  to  employ  a  space-variant  detector  sensitive  to  the  local  intensity, 
it  is  possible  to  record  this  modulation  term  along  with  the  unmodulated  (uniform)  bias 
represented  by  the  squares  of  the  two  amplitudes.  If  in  addition  the  "detector",  for  example, 
is  assumed  to  generate  a  change  in  either  its  absorption  coefficient  or  its  refractive  index 
as  a  function  of  the  recorded  intensity  pattern  (for  a  given  exposure),  a  diffraction  grating 
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will  be  formed.  The  resulting  diffraction  grating  can  be  characterized  by  an  amplitude 
that  is  proportional  to  the  product  of  the  input  signal  beam  amplitudes,  and  that  can 
be  probed  by  a  third  so-called  ‘‘readout"  beam.  This  is  at  once  the  basic  principle  of 
holographic  recording  (as  explained  in  more  detail  below  in  the  subsection  on  “Photonic 
Interconnections”),  and  at  the  same  time  allows  the  implementation  of  the  multiplicative 
operation  for  coherent  inputs.  It  should  be  noted  that  although  this  process  produces  a 
useful  result  for  the  case  of  two  inputs,  extension  to  larger  numbers  of  inputs  is  not  trivial, 
and  requires  the  utilization  of  higher  order  terms  in  the  susceptibility  tensor  (representing 
the  complex  dielectric  constant)  for  implementation.  The  one  exception  to  this  rule  is 
the  use  of  the  probe  (readout)  beam  intensity  IP  as  a  third  effective  input,  in  which  case 
output  intensities  proportional  to  either  lpa.\a.2  or  /p/1/2  can  be  detected  depending  on  the 
operational  parameters  of  the  recording  medium  and  readout  configuration. 

If  the  simultaneity  requirement  is  relaxed  to  allow  for  sequential  interactions  in  an 
intervening  photosensitive  medium,  then  it  is  possible  to  multiply  two  incoherent  input 
signals  by  means  of  the  simple  generic  scheme  shown  in  Figure  15.3.  The  medium  again 
acts  as  an  effective  detector  for  beam  1,  generating  a  transmittance  (in  its  range  of  linear¬ 
ity)  proportional  to  the  intensity  of  beam  1.  This  transmittance  can  be  generated  either 
directly,  or  through  the  exposure  given  by  the  product  of  the  intensity  and  the  exposure 
time  as  in  the  familiar  case  of  photographic  film.  Beam  2  is  effectively  employed  as  a  probe 
beam,  such  that  the  output  intensity  is  given  by: 

/out  =  c/i  h-,  (11) 

as  desired,  in  which  c  is  a  proportionality  constant  subject  to  the  constraint  that: 


c/,  <  1.  (12) 

This  process,  as  in  the  previous  case,  is  extendable  to  accommodate  an  arbitrary  number 
of  inputs  by  iteration,  unfortunately  resulting  both  in  a  lengthy  generation  sequence  for  a 
large  number  of  inputs,  and  in  the  potential  for  significant  nonlinear  effects  with  a  heavily 
constrained  overall  dynamic  range.  For  the  case  of  A  input  beams,  we  can  utilize  A  -  1 
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exposure  steps  in  combination  with  N  —  1  intermediate  readout  steps  and  a  final  readout 
step  with  beam  N  to  generate  an  output  intensity  of  the  form: 

lout  =  cV_1/,/2/3  -  •  •  Ix-ils,  With  cx-lhl2l3- ■•/*•_,  <  1.  (13) 

In  direct  analogy  to  the  case  of  summation,  we  could  instead  utilize  a  binary  tree  structure, 
which  requires  only  log2N  time  steps  but  uses  the  same  number  of  devices. 

Finally,  it  should  be  noted  that  this  latter  process  of  incoherent  beam  multiplication 
through  an  intervening  medium  by  sequential  illumination  is  suggestive  of  the  process 
of  spatial  light  modulation ,  in  which  the  same  basic  concept  is  extended  to  cover  a  two- 
dimensional  array  of  multiplication  elements.  In  fact,  this  process  is  an  essential  component 
of  the  general  area  of  photonic  switching,  to  which  we  will  turn  our  attention  below. 


FIGURE  15.3  Illustration  of  optical  multiplication  utilizing  a  medium  with 
variable  transparency. 


Before  turning  to  the  topics  of  photonic  switching  and  photonic  interconnections,  we 
conclude  this  section  with  a  discussion  of  the  fourth  principal  computational  process  that 
can  be  performed  optically  (as  opposed  to  optoelectronically,  as  discussed  below):  that 
of  the  incorporation  of  functional  nonlinearity.  Although  many  types  of  functional  non- 
linearities  are  of  interest  in  a  generalized  analog  computational  system,  those  of  primary 
utility  in  the  neural  network  environment  are  for  the  most  part  threshold-like  in  nature.  A 
threshold  function  /t(x)  of  some  input  variable  x  (such  as  the  input  intensity,  for  example) 
can  be  described  in  general  by: 


—  oc  <  x  <  Xi 
X]  <  X  <  J2 

X2  <  X  <  oc 


/t(x)  =  rmm; 
fT(x)  =  m(x); 
/r(x)  =  Tmaz\ 
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(14) 


in  which  the  function  m(x)  is  a  monotonic  function  with  a  minimum  value  of  Tmtn  and  a 
maximum  value  of  Tmax.  For  a  step  function  response,  the  function  m(x)  can  be  eliminated 
by  setting  Xi  =  j2-  In  many  cases,  a  smoother  transition  between  the  two  extreme  states 
has  been  found  to  generate  enhanced  network  stability  and  faster  settling  times.  In  such 
cases,  the  function  m(x)  may  be  taken,  for  example,  as  a  sigmoid  with  an  exponential 
onset  and  an  asymptotic  approach  to  the  saturation  level. 

The  incorporation  of  such  nonlinear  functionality  by  direct  optical  means  can  be  achieved 
through  the  use  of  a  number  of  different  types  of  nonlinear  materials;  such  materials  typ¬ 
ically  exhibit  a  change  in  their  refractive  index  or  absorption  coefficient  proportional  to 
the  first  and  higher  order  powers  of  the  local  optical  intensity.  One  example  of  such  a 
material  is  photographic  film,  which  after  development  exhibits  a  (negative)  sigmoid-like 
exposure  characteristic,  with  a  saturation  value  determined  by  the  maximum  optical  den¬ 
sity  achievable  within  a  given  film  thickness.  (The  optical  density  (OD)  of  a  medium  is 
given  by  the  negative  of  the  decadic  logarithm  of  its  transmittance;  for  example,  a  film  that 
transmits  1%  of  the  incident  illumination  has  an  optical  density  of  two  (OD2)).  Another 
common  example  of  an  optical  nonlinearity  is  the  photoconductive  saturation  behavior  of 
certain  semiconductor  materials  such  as  cadmium  sulfide,  zinc  selenide,  and  silicon.  In 
this  latter  case,  the  distinction  between  an  “all-optical”  nonlinearity  and  an  optoelectronic 
nonlinearity  becomes  somewhat  blurred,  as  the  photoconductor  can  be  thought  of  as  a 
light-sensitive  electronic  device. 

Such  optical  techniques  for  the  generation  of  functional  nonlinearities  at  present  suffer 
several  inherent  disadvantages,  in  that  they  often  require  either  on  off-line  post-exposure 
development  step  (which  is  unsuitable  for  real  time  operation  at  high  frame  rates),  long 
response  times,  or  very  high  optical  intensities  to  achieve  saturation.  In  addition,  such 
materials  as  of  yet  have  not  proven  to  be  readily  programmable,  which  is  often  a  desirable 
feature  from  the  systems  perspective  in  order  to  accommodate  variable  threshold  functions, 
gain,  saturation  values,  and  offsets.  As  we  will  see  in  the  next  section,  the  incorporation 
of  electronic  circuitry  with  optical  detectors  and  modulators  to  achieve  optoelectronic  non- 
linearities  can  in  fact  greatly  increase  the  threshold  sensitivity  and  operational  bandwidth 
of  nonlinear  switching  elements,  while  simultaneously  providing  flexible  programming  ca- 
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pabilities. 


Photonic  Switching 


The  switching  function,  that  of  providing  an  output  that  is  (perhaps  nonlinearly)  de¬ 
pendent  on  one  or  more  inputs,  is  a  principal  distinguishing  characteristic  of  neuron  units. 
Electronic  circuit  elements  (particularly  as  configured  by  very  large  scale  integration  tech¬ 
niques)  are  quite  well  suited  to  the  switching  task,  as  long  as  the  number  of  inputs  (rep¬ 
resenting  the  fan-in)  and  the  number  of  outputs  (representing  the  fan-out)  are  both  kept 
relatively  small  (less  than  a  few  hundred  or  so  for  the  case  of  analog  fan-in  and  fan-out). 
However,  for  neural  network  implementations  that  demand  a  high  degree  of  connectivity 
(with  a  concomitantly  large  number  of  neuron  units),  the  required  gate  count  as  well  as 
the  area  required  for  interconnection  routing  in  purely  electronic  -mplementations  rapidly 
gets  out  of  hand. 

The  fundamental  aspects  of  the  fan-in  and  fan-out  components  of  the  switching  function 
are  quite  distinct,  and  lead  to  different  types  of  demands  on  the  chosen  implementation 
technology.  The  fan-in  of  a  number  of  inputs  requires  that  a  particular  functional  relation¬ 
ship  be  established  between  the  generated  output,  on  the  one  hand,  and  the  set  of  inputs, 
on  the  other.  In  the  case  of  a  neural  network,  the  output  typically  depends  on  both  sums 
and  differences  of  various  combinations  of  the  inputs.  Therefore,  a  given  implementation 
technology  must  properly  generate  the  requisite  logical  or  functional  relationship,  as  well 
as  provide  for  an  appropriate  physical  input  mechanism  (e.g.  the  input  leads  in  the  case 
of  an  electronic  implementation).  For  electronic  circuits,  the  network  area  required  for  the 
provision  of  input  leads  and  functional  circuitry  typically  scales  directly  with  the  number  of 
inputs,  which  is  an  unfortunate  dependence  when  the  number  of  inputs  is  large.  Fan-out. 
on  the  other  hand,  usually  implies  the  broadcast  of  a  single  output  value  to  a  number  of 
(input)  locations  or  nodes.  In  electronics,  the  output  power  required  to  drive  the  inputs  to 
a  large  number  of  nodes  scales  directly  with  the  number  of  these  nodes,  which  again  does 
not  scale  favorably  (but  turns  out  to  be  an  unavoidable  penalty  in  any  case).  Significant 
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(A’-fold)  fan-out  often  involves  the  incorporation  of  high  power  driver  circuitry,  which  may 
have  to  be  duplicated  M  times  (A/  <  N)  in  order  to  avoid  unacceptable  loading  of  the 
output  stages. 

The  combination  of  both  fan-in  and  fan-out  components  of  the  switching  function 
reveals  a  further  demand  on  the  real  estate  required  for  the  establishment  of  weighted 
interconnections.  In  a  fully  connected  neural  network  with  N  neurons,  for  example,  area 
must  be  provided  for  the  incorporation  (storage  and  programming)  of  N 2  independent 
weights  as  well  as  N2  independent  signal  pathways.  Hence,  the  chip  area  required  in  a 
VLSI  circuit  implementation  of  such  a  fully  connected  neural  network  will  scale  at  least  as 
the  square  of  the  number  of  neuron  units  (0(N2)).  Network  segmentation  into  a  number  of 
interconnected  chips  can  help  somewhat  to  expand  the  network  size  beyond  the  limitation 
imposed  by  applying  this  constraint  to  a  single  chip.  However,  the  limiting  factor  in  the 
multiple-chip  case  rapidly  becomes  interchip  communication  (I/O),  as  pinouts  from  VLSI 
chips  of  greater  than  two  hundred  or  so  are  not  technologically  feasible  at  present. 

Photonic  implementations  of  neural  networks  take  advantage  of  the  simple  beam¬ 
combining  mechanisms  outlined  above  to  multiplex  inputs  and  outputs,  and  as  such  exhibit 
much  higher  capacity  for  fan-in  and  fan-out  than  do  typical  electronic  implementations. 
The  utilization  of  optical  rather  than  electronic  interconnections  for  the  fan-in  and  fan-out 
functions  provides  for  completely  different  scaling  laws  at  large  numbers  of  inputs  and 
outputs  to  a  given  neuron  unit,  as  described  in  more  detail  in  the  following  subsection 
(“Photonic  Interconnections”). 

Even  given  photonic  interconnections  with  a  high  degree  of  fan-in  and  fan-out  capabili¬ 
ty,  the  nonlinear  functional  (switching)  relationship  between  the  output  and  combinations 
of  the  inputs  must  still  be  provided  for.  For  purposes  of  neural  network  implementation, 
the  primary  photonic  switching  component  is  the  spatial  light  modulator ,  a  device  that 
alters  either  the  amplitude  or  phase  across  an  expanded  probe  beam  in  response  to  the 
local  intensity  (or  exposure)  across  an  input  (writing  or  recording)  beam. 

The  simplest  example  of  a  spatial  light  modulator,  albeit  one  that  cannot  operate  at 
real  time  frame  rates,  is  photographic  film.  Following  exposure  to  an  information-bearing 
optical  field,  in  which  an  image  of  a  given  scene  is  brought  into  focus  on  the  two-dimensional 
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plane  of  the  film,  a  “latent"  (undeveloped)  image  is  formed  within  the  photographic  emul¬ 
sion  on  the  surface  of  the  film.  Chemical  development  is  used  to  transform  the  latent 
image  into  a  measurable  change  in  the  optical  transparency  (transmissivity)  of  the  film, 
which  can  then  be  “read  out"  or  probed  by  secondary  illumination  to  reveal  features  of 
the  recorded  sct.ie.  In  this  context,  slide  projection  is  in  fact  the  equivalent  of  amplified 
readout  with  a  probe  beam,  in  the  sense  that  the  reconstruction  of  the  image  is  accom¬ 
plished  at  a  much  higher  level  of  intensity  (for  a  longer  period  of  time)  than  the  original 
exposure. 

As  can  be  seen  from  this  example,  the  basic  functions  performed  by  a  spatial  light  mod¬ 
ulator  are  those  of  detection,  functional  transformation,  and  optical  modulation,  as  shown 
schematically  in  Figure  15.4(a).  In  the  case  of  photographic  film,  the  detection  process  oc¬ 
curs  at  photosensitive  centers  during  exposure,  the  functional  transformation  (the  transfer 
function  that  relates  the  output  transparency  to  the  input  exposure)  is  incorporated  during 
development  and  fixing,  and  the  optical  modulation  process  is  accessed  during  readout. 
This  division  of  the  spatial  light  modulation  function  into  three  key  elements  is  partic¬ 
ularly  useful  in  the  discussion  of  optoelectronic  spatial  light  modulators,  which  typically 
consist  of  separately  identifiable  detectors,  control  circuitry,  and  modulation  elements,  as 
shown  schematically  in  Figure  15.4(b-d).  This  functional  division  also  allows  extensive  use 
to  be  made  of  sophisticated  electronic  circuitry  deployed  locally  within  each  pixel,  both 
to  generate  programmable  nonlinear  control  functions  and  to  compensate  to  a  certain  de¬ 
gree  for  the  nonidealities  inherent  in  the  optical  detection  and  optical  modulation  elements. 


FIGURE  15.4  Fundamental  principles  of  spatial  light  modulator  function: 
(a)  block  diagram  of  the  principal  functions  of  an  optically-addressed  spatial 
light  modulator,  including  the  detection,  functional  implementation,  and  mod¬ 
ulation  functions;  (b)  schematic  diagram  of  an  N  x  N  array  of  spatial  light 
modulator  pixels,  in  which  three  pixels  are  shown  in  different  transmission  s- 
tates;  (c)  expanded  view  of  the  pixel  array,  showing  an  incomplete  fill  factor 
within  each  pixel;  (d)  expanded  view  of  u.  single  pixel  within  the  array,  illustrat- 


ing  one  possible  pixel  configuration  that  incorporates  two  detector  elements  D\ 
and  £>2 ,  control  electronics  for  impedance  matching  and  functional  implementa¬ 
tion,  and  two  modulator  elements,  shown  here  in  different  transmittance  states. 


Up  to  this  point  in  the  discussion,  we  have  focused  on  optically-addressed  spatial  light 
modulators  (OASLMs)  that  respond  locally  to  the  incident  light  intensity,  as  this  light 
detection  function  is  common  to  most  envisioned  photonic  neural  network  architectures. 
Another  way  of  controlling  the  modulation  within  an  array  on  a  pixel-by-pixel  basis  is  to 
configure  the  spatial  light  modulator  such  that  it  can  accept  a  serial  or  parallel  electronic 
input  signal,  which  can  be  decoded  (or  demultiplexed)  to  drive  each  individual  modulator 
element.  Such  electrically-addressed  spatial  light  modulators  (EASLMs)  can  be  driven,  for 
example,  by  the  output  of  a  television  camera  to  again  combine  the  functions  of  detection, 
functional  transformation  (which  may  be  accomplished  in  an  external  circuit),  and  mod¬ 
ulation.  One  advantage  of  such  a  combination  is  the  current  advanced  state  of  the  art  in 
closed  circuit  television  cameras  (CCTVs),  which  exhibit  exceedingly  high  performance  at 
relatively  low  cost.  One  notable  disadvantage,  however,  is  the  implied  limitation  on  the 
frame  rate  of  the  combined  device,  since  most  high  resolution  TV  cameras  are  designed  to 
operate  at  less  than  one  hundred  frames  per  second. 

Over  the  past  two  decades,  a  wide  range  of  physical  modulation  mechanisms  have  been 
investigated  for  use  in  various  types  of  spatial  light  modulators.  Such  mechanisms  include 
the  modulation  of  the  index  of  refraction  or  birefringence  in  single  crystal  materials  by 
means  of  an  applied  electric  field  (the  electrooptic  effect),  the  reorientation  of  liquid  crys¬ 
tal  molecules  (producing  in  turn  a  change  in  the  index  of  refraction  or  birefringence)  by 
either  an  applied  electric  field  or  by  local  optically-induced  heating,  changes  in  coloration 
produced  by  optical  absorption  (the  photochromic  effect),  modulation  of  the  polarization 
of  reflected  light  by  application  of  local  magnetic  fields  (the  magnetooptic  effect),  surface 
deformations  in  a  thin  film  or  membrane  induced  by  either  applied  electric  fields  or  local 
optically-induced  heating,  changes  in  the  local  refractive  index  induced  by  the  applica¬ 
tion  of  pressure  or  by  the  transmission  of  an  acoustic  wave  (the  acoustooptic  effect),  and 


electric  field  modulation  of  the  absorption  or  dispersion  properties  of  semiconductor  device 
structures.  The  utilization  of  these  physical  modulation  mechanisms  in  various  spatial  light 
modulator  configurations  has  been  addressed  in  a  number  of  review  articles  [Tanguay,  1985: 
Warde,  1987],  journal  special  issues  [e.g..  Spatial  Light  Modulators  for  Optical  Information 
Processing.  1989],  and  topical  conference  proceedings  [e.g..  Spatial  Light  Modulators  and 
Applications ,  1990], 

The  principal  configurational  and  operational  characteristics  of  spatial  light  modulators 
that  are  of  interest  for  application  to  neural  networks  include  optica]  sensitivity,  write 
(input)  wavelength,  read  (output)  wavelength,  input-output  transfer  function,  functional 
programmability,  operational  bandwidth,  degree  of  integration,  pixel  size,  total  number  of 
pixels  per  chip,  output  modulation  contrast  ratio,  dynamic  range,  and  dissipated  power 
density.  In  many  cases,  these  characteristics  are  interdependent,  and  thus  impose  at  times 
contradictory  design  constraints  that  must  be  optimized  in  the  overall  systems  context. 
The  fundamental  and  technological  limitations  that  affect  device  design  and  performance 
are  discussed  further  below  and  in  a  succeeding  section. 

As  is  the  case  for  electronic  circuitry,  both  monolithic  and  hybrid  approaches  to  the 
development  of  optoelectronic  spatial  light  modulators  with  suitable  functionality  have 
been  employed.  In  the  monolithic  approach,  the  detectors,  control  circuitry,  and  mod¬ 
ulation  elements  within  each  individual  picture  element  (pixel)  are  integrated  within  a 
single  class  of  materials  on  a  supporting  substrate,  as  shown  schematically  in  Figure  15.5. 
An  example  of  such  an  approach  is  the  integration  of  p-n  or  p-i-n  junction  photodiodes 
with  metal-semiconductor  field  effect  transistors  (MESFETs)  [Sze,  1981a]  to  drive  multi¬ 
ple  quantum  well  (MQW)  optical  modulators  based  on  the  quantum  confined  Stark  effect 
(QCSE)  [Miller,  1990],  all  fabricated  by  means  of  photolithographic  processing  with  mul¬ 
tiple  mask  levels  on  gallium  arsenide  ( GaAs )  substrates.  In  Figure  15.5,  two  distinct 
approaches  to  the  monolithic  integration  of  spatial  light  modulators  are  illustrated,  differ¬ 
entiated  primarily  by  the  method  employed  to  physically  or  electrically  isolate  (pixelate) 
the  modulator  elements. 

Two  particularly  critical  parameters  of  spatial  light  modulators  used  in  neural  network 
implementations  are  the  contrast  ratio  and  dynamic  range  of  the  modulator.  Their  values 


can  in  certain  cases  be  increased  by  incorporating  the  active  modulation  layer  (for  example, 
a  multiple  quantum  well  device)  within  a  symmetric  or  asymmetric  optical  (Fabry-Perot) 
cavity  [Whitehead,  1989a;  Whitehead.  1989b:  Whitehead.  1989c;  Van.  1989],  The  asym¬ 
metric  case  is  shown  schematically  in  Figure  15.5(a).  in  which  two  multilayer  Bragg  mirrors 
are  used  to  form  a  reflective  cavity  with  a  high  reflectivity  (R)  on  the  substrate  side,  and  a 
lower  reflectivity  on  the  air-incident  side.  One  of  several  advantages  of  monolithic  integra¬ 
tion  is  the  potential  for  utilizing  common  components  for  multiple  purposes.  For  example, 
the  basic  MQW  modulator  structure  can  also  be  used  as  a  p-i-n  photodetector  by  applica¬ 
tion  of  appropriate  bias  voltages,  as  shown  in  Figure  15.5(b).  To  date,  significant  progress 
in  such  monolithically  integrated  optical  modulators  has  been  achieved,  although  spatial 
light  modulators  with  large  numbers  (>  1 04 )  of  pixels  have  not  yet  been  fabricated  that 
exhibit  the  relatively  high  degree  of  integration  described  above. 


FIGURE  15.5  Examples  of  monolithically-integrated  spatial  light  modula¬ 
tors.  The  chosen  examples  incorporate  photodetectors,  control  circuitry,  and 
multiple  quantum  well  modulators  within  each  pixel  on  a  single  gallium  ar¬ 
senide  (GaAs)  substrate.  In  (a),  the  control  electronics  and  photodetector 
elements  are  fabricated  following  the  photolithographic  definition  and  physical 
isolation  of  the  modulator  elements,  while  in  (b)  a  buffer  (isolation)  layer  is 
used  to  allow  fabrication  and  interconnection  of  all  of  the  elements  without 
chemical  or  ion  beam  etching. 


In  the  hybrid  approach,  on  the  other  hand,  certain  of  the  device  functions  may  be  inte¬ 
grated  on  a  substrate  within  one  materials  system  (with  its  associated  process  technology) 
in  order  to  optimize  either  their  performance  characteristics  or  manufacturability,  while 
others  are  integrated  on  a  separate  substrate  within  a  different  materials  system  (with  a 
necessarily  distinct  process  technology).  Following  separate  processing  sequences  for  each 
individual  component,  the  two  substrates  are  then  interconnected  (bonded  together)  such 


that  the  mating  pixels  on  each  substrate  are  in  pairwise  electrical  contact.  For  example, 
several  currently  investigated  types  of  spatial  light  modulators  (SLMs)  incorporate  the  de¬ 
tection  elements  and  control  circuitry  on  a  silicon  (St)  substrate  utilizing  standard  VLSI 
design  rules,  while  the  modulation  elements  are  based  in  a  separate  technology  (such  as 
multiple  quantum  well  structures  integrated  on  a  GaAs  substrate).  Alternatively,  hybrid 
spatial  light  modulators  can  be  fabricated  on  a  single  common  substrate,  with  additional 
functionality  provided  by  the  growth,  deposition,  or  coating  of  a  second  active  material 
onto  the  substrate.  Examples  of  this  type  of  hybrid  SLM  include  silicon  VLSI/ferroelectric 
liquid  crystal  devices  [Drabik,  1990]  and  silicon/PLZT  devices  [Lin,  1990].  Such  hybrid 
SLMs  are  also  in  the  early  stages  of  advanced  development,  and  are  the  subject  of  current 
intensive  research  and  development  efforts  [ Spatial  Light  Modulators  for  Optical  Informa¬ 
tion  Processing,  1989;  Spatial  Light  Modulators  and  Applications.  1990]. 

Using  either  of  these  two  approaches  to  spatial  light  modulator  fabrication,  devices 
based  on  both  transmissive  and  reflective  readout  can  be  constructed,  with  different  impli¬ 
cations  on  the  overall  systems  design  in  each  case.  In  particular,  the  reflective  mode  can 
be  used  to  advantage  in  configuring  a  hybrid-integrated  SLM  to  mate  the  detection  and 
control  circuitry  functions  of  the  device  with  the  optical  modulation  function.  Use  of  the 
reflective  readout  configuration  allows  the  detection  and  control  circuitry  to  be  integrated 
on  a  substrate  that  is  opaque  to  the  readout  illumination  wavelength  [Kyriakakis,  1990], 
as  shown  schematically  in  Figure  15.6. 


FIGURE  15.6  Example  of  a  hybrid  spatial  light  modulator,  in  which  the 
photodetectors  and  control  electronics  are  fabricated  on  a  silicon  substrate, 
and  the  multiple  quantum  well  modulator  elements  are  fabricated  on  a  gallium 
arsenide  (GaAs)  substrate.  The  two  sets  of  devices  are  bump  contacted  on  a 
pixel-by-pixel  basis  to  provide  parallel  electrical  continuity. 


As  an  example  of  the  degree  of  functional  integration  currently  envisioned  for  spatial 


light  modulators  that  are  specifically  designed  for  photonic  implementations  of  neural  net¬ 
works,  a  silicon-based  CMOS  chip  has  recently  been  designed  and  fabricated  [Asthana. 
1990a;  Asthana,  1990b]  that  incorporates  two  input  detectors,  control  circuitry,  and  two 
(optical  modulator)  output  drivers  within  each  100  x  100  fim  pixel  as  shown  schematically 
in  Figure  15.7(a).  It  should  be  noted  that  these  current  dimensions  do  not  in  any  sense 
represent  a  lower  limit,  but  rather  a  practical  size  for  laboratory  demonstrations  and  ex¬ 
periments,  as  well  as  a  useful  size  from  the  perspective  of  neural  network  applications.  The 
pixel  layout  allows  for  two  30  x  50  /xm  detectors,  followed  by  a  15  transistor  dual  input, 
dual  output  differential  amplifier  that  implements  a  sigmoid-like  transfer  function,  with 
externally  programmable  saturation  characteristics.  Output  pads  are  also  provided  for 
hybrid  bonding  (by  bump  contact  techniques  [Shirouzu,  1986])  to  an  InGaAs/GaAs  mul¬ 
tiple  quantum  well  modulator  structure  fabricated  on  a  GaAs  substrate  [Kyriakakis,  1990]. 
Utilizing  2  nm  CMOS  design  rules,  the  control  circuitry  easily  fits  within  25  x  100  fim. 
leaving  adequate  space  for  the  modulator  output  pads  as  shown  in  Figure  15.7(b,  c).  This 
currently  allows  for  the  integration  of  104  pixels  per  cm2,  or  6  x  104  pixels  per  in2.  The 
functional  operation  of  this  circuit  will  be  discussed  in  the  section,  “An  Implementation 
Strategy”,  below. 


FIGURE  15.7  VLSI  layout  of  a  generalizable  silicon-based  spatial  light 
modulator  structure:  (a)  neuron  pixel  layout;  (b)  photograph  of  a  single  neu¬ 
ron  unit  in  VLSI  implementation,  with  probe  pads  substituted  for  the  two 
detectors  (bottom)  and  for  contact  to  the  two  modulation  elements  (top);  (c) 
photograph  of  a  6  x  6  array  of  neuron  units  on  a  VLSI  chip  that  incorporates 
additional  test  circuitry. 


Before  leaving  the  subject  of  photonic  switching,  it  should  be  noted  that  the  general 
principles  outlined  above  can  be  used  to  design  a  wide  variety  of  mutually  compatible 
devices  with  different  functionalities  as  well  as  different  tradeoffs  among  the  set  of  con- 
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figurational  and  operational  parameters.  For  example,  it  is  relatively  straightforward  to 
design  time-integrating  and  time-differentiating  circuits;  sharp  (step-like)  thresholds;  level 
slices;  sigmoid-like  functions,  their  complements,  and  their  derivatives;  inverters;  and  log¬ 
arithmic  amplifiers.  Many  such  functions  can  be  implemented  with  only  a  few  integrated 
components,  such  as  capacitors,  diodes,  transistors  employed  as  current  amplifiers,  and  bi¬ 
ased  transistors  employed  as  resistors  [Mead.  1989],  Therefore,  these  functions  can  easily 
be  incorporated  within  each  pixel  (neuron  unit)  of  a  two-dimensional  spatial  light  modu¬ 
lator.  as  well  as  in  some  cases  between  pixels  for  the  implementation  of  non-local  (other 
than  pointwise)  operations  such  as  automatic  gain  control  and  nearest-neighbor  inhibition. 


Photonic  Interconnections 


Given  that  the  neuron  units  are  to  be  represented  by  individual  pixels  within  a  two- 
dimensional  spatial  light  modulator,  interconnections  must  now  be  established  between 
each  individual  neuron  unit  (pixel)  and  many  (if  not  all)  other  neuron  units.  As  such, 
the  chosen  interconnection  scheme  must  be  capable  of  the  appropriate  degree  of  fan-in 
and  fan-out,  be  characterized  by  sufficient  transmission  bandwidth  in  each  channel,  and 
be  scalable  to  relatively  large  numbers  of  neuron  units.  In  addition,  the  neural  network 
paradigm  presents  the  additional  requirement  that  the  interconnections  be  weighted ,  such 
that  the  output  from  a  given  point-to-point  interconnection  is  proportional  to  the  product 
of  the  input  and  a  stored  constant  or  weight.  It  is  in  fact  this  requirement  that  eliminates 
from  consideration  a  large  number  of  possible  switching  networks  that  provide  full  recon¬ 
figurability  in  a  non-blocking  manner  (such  as  a  crossbar  or  shuffle-exchange  network), 
but  without  the  capacity  to  incorporate  weights  within  each  interconnection  pathway. 
In  adaptive  networks  (those  that  incorporate  learning  algorithms),  these  interconnection 
weights  must  have  the  capability  of  being  updated  in  a  manner  determined  by  the  partic¬ 
ular  learning  algorithm  employed.  A  nontrivial  consequence  of  these  last  two  requirements 
is  that  the  interconnection  weights  must  be  stored  for  at  least  as  long  as  the  average  it¬ 
erative  computation,  if  not  much  longer;  yet,  they  must  simultaneously  exhibit  dynamic 


programmability  if  the  network  is  to  exhibit  either  short-term  or  long-term  plasticity. 

For  very  small  numbers  of  neurons  with  a  low  degree  of  connectivity,  one  possible 
way  of  forming  the  interconnection  network  would  be  to  use  fiber  optic  transmission  lines 
with  modulated  semiconductor  laser  diodes  as  sources  and  optical  receivers  as  detectors, 
much  like  a  fiber  optic  local  area  network.  The  weights  could  be  incorporated  by  means 
of  a  variable  gain  amplifier  at  either  end  of  each  fiber  optic  link,  with  weight  storage  in 
local  dynamic  random  access  memory  (RAM)  or  static  read  only  memory  (ROM)  circuits. 
Unfortunately,  the  sheer  bulk  of  each  transmitter,  receiver,  and  fiber  optic  link  precludes 
scalability  to  large  neural  network  systems.  For  example,  a  fully  connected  twenty  neuron 
network  would  involve  four  hundred  sets  of  sources,  transmission  lines,  and  detectors, 
which  would  currently  represent  a  prohibitive  requirement.  The  same  would  be  true  of  a 
fifty  neuron  network  with  a  fan-out  and  fan-in  of  eight,  representing  a  relatively  low  degree 
of  connectivity. 

In  order  to  be  able  to  satisfy  the  interconnection  requirements  for  a  large  number  of 
neuron  units  that  are  fully  or  nearly  fully  interconnected,  the  appropriate  photonic  tech¬ 
nology  to  employ  is  that  of  holographic  interconnections,  in  which  the  weights  as  well  as 
the  interconnection  patterns  themselves  are  stored  as  holograms  in  either  a  fixed  (static) 
or  real  time  (dynamic)  holographic  recording  material.  In  this  section,  we  first  discuss  the 
basic  principles  that  apply  to  the  utilization  of  holographic  recording  for  point-to-point 
interconnections.  Next,  we  describe  the  physical  origins  of  a  number  of  complexities  with 
holographic  interconnection  schemes  that  lead  to  both  interchannel  crosstalk  and  through¬ 
put  losses.  An  architecture  that  lends  itself  to  the  minimization  of  such  complications 
will  be  described  in  detail  in  the  section,  “An  Implementation  Strategy”,  below.  Finally, 
the  potential  for  incorporation  of  real  time  volume  holographic  recording  media  such  as 
photorefractive  materials  in  holographic  interconnection  networks  is  addressed. 

The  essential  principle  of  holographic  recording,  that  of  the  space-variant  interference 
of  two  mutually  coherent  wavefronts,  was  discussed  briefly  in  reference  to  Equation  (10) 
and  is  illustrated  in  Figure  15.2(b).  In  this  figure,  two  angularly  separated  (noncollinear) 
collimated  beams  are  incident  on  a  photosensitive  material,  such  that  their  mutual  inter¬ 
ference  locally  exposes  the  material  to  the  intensity  distribution  given  by  Equation  (10). 
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In  Figure  15.2(b),  the  photosensitive  material  was  assumed  to  spatially  integrate  across 
the  interference  pattern,  producing  an  output  that  depends  on  only  the  spatial  average 
of  the  intensity  distribution.  Suppose  now  that  we  use  instead  a  photosensitive  material 
with  the  property  that  its  local  index  of  refraction  or  absorption  coefficient  depends  on  the 
local  incident  intensity  (exposure),  which  allows  the  complete  interference  pattern  to  be 
recorded.  The  resulting  change  in  the  local  optical  properties  of  the  medium  may  either 
be  immediate  (as  in  the  case  of  a  photochromic  transformation,  for  example),  or  may  re¬ 
quire  development  following  exposure  (as  in  the  case  of  bleached  photographic  negatives 
or  dichromated  gelatin  thin  films).  Figure  15.8(a)  shows  such  a  detection  or  recording 
geometry  in  which  a  thin  semi-transparent  layer  of  photosensitive  material  acts  as  a  quasi- 
planar  holographic  recording  medium.  The  interference  pattern  produced  by  the  mutually 
coherent  signal  and  reference  beams  within  the  holographic  recording  medium  is  recorded 
to  form  a  diffraction  grating  within  the  volume  accessed  by  both  beams  simultaneously, 
as  shown  in  the  figure.  For  simplicity  in  Figure  15.8  (as  well  as  in  subsequent  figures), 
we  have  not  shown  the  refraction  of  the  incident  and  transmitted  beams  at  the  input  and 
output  faces  of  the  holographic  recording  medium  that  occurs  due  to  a  difference  between 
the  refractive  indices  of  the  medium  and  its  surround.  The  amplitude  (and  intensity)  of 
the  reflected  beam  shown  in  Figure  15.8(b)  depends  directly  on  the  index  difference,  and 
represents  a  throughput  loss  on  readout. 


FIGURE  15.8  A  simplified  holographic  recording  configuration:  case  of 
plane  wave  signal  and  reference  beams,  and  a  thin  holographic  recording  medi¬ 
um;  (a)  recording,  and  (b)  reconstruction  with  a  plane  wave  readout  beam. 


Consider  first  the  case  of  an  exposure-dependent  refractive  index  variation.  Illumination 
of  such  a  space- variant  modulation  of  the  refractive  index  by  a  coherent  collimated  beam  of 
the  same  wavelength  A  as  the  exposure  (writing)  beams  will  result  in  a  diffraction  pattern 
consisting  of  several  collimated  beams,  each  emanating  in  a  characteristic  direction  as 


shown  schematically  in  Figure  15.8(b),  and  as  given  by  the  following  equation: 

2  77 

Lx  =  krr  +  mKc:  |KC|  =  —  =  |k2  -  k^;  m  =  0,  ±1,  ±2,  ■  ■  ■  (15) 
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In  this  equation,  kmr  is  the  x-component  of  the  wave  vector  of  the  mth  diffracted  beam 
(diffraction  order),  Kg  is  the  wave  vector  (assumed  oriented  along  the  x-axis)  of  the 
interference  pattern  (diffraction  grating)  formed  by  the  two  writing  beams  (with  wave 
vectors  k]  and  k2),  krx  is  the  x-component  of  the  wave  vector  of  the  incident  readout 
beam  propagating  in  the  x-z  plane,  and  A g  is  the  spatial  wavelength  of  the  diffraction 
grating.  The  multiple  diffracted  orders  result  from  the  phase  modulation  of  the  readout 
(probe)  beam  by  the  refractive  index  modulation  n(x)  of  the  thin  holographic  grating;  the 
magnitude  and  phase  of  the  readout  beam  amplitude  immediately  after  passing  through 
the  hologram  (located  at  the  position  2o)  can  be  written  in  the  form: 

Adlfs  =  are<(fc"x+fer**°)e,'*c(x).  (16) 

in  which  Ad,//  is  the  amplitude  of  the  diffracted  wavefront,  <Pg{x)  =  2irn(x)d/\  is  the  local 
phase  shift  induced  by  the  diffraction  grating  (assumed  to  be  of  thickness  d),  are,(krxT+kr,z o) 
is  the  incident  readout  beam  amplitude  at  the  exit  plane  of  the  hologram  (x0)»  and  kTI  and 
kTZ  are  the  x  and  z  components  of  the  wave  vector  kr,  respectively.  Each  of  the  diffracted 
orders  can  then  be  directly  associated  with  a  corresponding  Fourier  component  of  the 
modulated  amplitude  [Goodman,  1968],  which  can  be  expanded  in  terms  of  the  form; 

Ame™K°z.  (17) 

In  order  to  assess  the  effectiveness  with  which  the  holographic  grating  diffracts  the  incident 
beam  into  a  particular  diffraction  order,  it  is  convenient  to  define  the  diffraction  efficiency 
t)  of  each  order  as: 


n  =  \M. 

v  ~  kl2 ' 


(18) 


The  essential  diffraction  properties  of  thin  absorption  gratings  (in  which  the  modulation 
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occurs  in  the  local  absorption  coefficient)  are  the  same  as  for  the  case  of  thin  pure  phase 
gratings,  with  two  principal  exceptions:  (1)  for  sinusoidal  absorption  gratings,  the  diffract¬ 
ed  orders  are  limited  to  m  =  —1,0,  and  1;  and  (2)  the  presence  of  absorption  significantly 
decreases  the  maximum  first  order  diffraction  efficiency  that  can  be  achieved. 

In  order  to  illustrate  how  such  holographic  gratings  can  be  employed  to  generate  weight¬ 
ed  point-to-point  interconnections,  we  need  to  introduce  two  additional  concepts:  the  lens 
as  an  angle-to-position  encoder,  and  the  superposition  of  holographic  gratings  recorded 
with  different  diffraction  efficiencies.  The  first  concept  can  be  understood  with  reference 
to  Figure  15.9,  in  which  a  simple  lens  is  placed  one  focal  length  away  from  a  point  source  in 
the  input  plane  of  a  photonic  interconnection,  and  a  second  simple  lens  is  placed  one  focal 
length  away  from  the  output  plane.  What  is  normally  thought  of  as  the  focal  property 
of  a  lens  results  in  the  generation  of  a  collimated  beam  (a  beam  comprising  both  parallel 
rays  and  planar  wavchonts)  following  the  first  lens,  with  an  angle  (both  in  and  out  of  the 
plane  of  the  page)  that  depends  on  the  position  of  the  point  source  in  the  focal  (input) 
plane.  In  this  sense,  the  first  lens  acts  as  a  position-to-angle  encoder,  providing  a  one- 
to-one  correspondence  between  the  input  location  and  the  output  collimated  beam  angle. 
Depending  on  the  nature  of  the  grating  stored  within  the  holographic  optical  element, 
the  collimated  beam  will  be  diffracted  into  a  new  direction  characterized  by  a  different 
angle.  The  second  lens  will  then  focus  the  diffracted  beam  to  a  point  in  the  output  plane 
that  depends  on  this  angle,  thus  acting  as  an  angle-to-position  encoder.  The  utilization  of 
different  orientations  of  gratings  within  the  holographic  optical  element  allows  for  the  inter¬ 
connection  of  any  arbitrary  point  in  the  input  plane  to  any  other  point  in  the  output  plane. 


FIGURE  15.9  A  point-to-point  interconnection  system,  using  a  holographic 
optical  element  (HOE)  for  interconnection  routing,  and  lenses  as  position-to- 
angle  and  angle-to-position  encoders.  In  this  example,  the  holographic  optical 
element  effectively  performs  an  input  angle  to  output  angle  transformation, 
such  that  light  emitted  (or  transmitted)  at  point  pi  in  the  input  plane  (Pt)  is 
detected  at  point  p2  in  the  output  plane  (P2)- 
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Suppose  now  that  we  do  in  fact  choose  to  superimpose  a  number  of  planar  gratings 
within  the  holographic  medium,  each  with  a  different  wave  vector  (orientation  and  grat¬ 
ing  period)  and  grating  modulation  (variation  of  the  refractive  index  or  the  absorption 
coefficient).  Assuming  for  the  moment  that  the  diffraction  process  is  linear,  each  input 
point  will  be  interconnected  with  a  number  of  output  points  as  determined  by  the  set 
of  recorded  gratings.  Likewise,  each  output  point  will  be  interconnected  with  a  specified 
number  of  input  points.  Each  interconnection  will  be  weighted  by  its  diffraction  efficiency 
as  determined  by  Equation  (18),  which  is  in  turn  dependent  on  the  index  of  refraction  (or 
absorption  coefficient)  variation  recorded  for  each  grating.  As  such,  the  holographic  optical 
element  acts  as  a  multi-port  variable  beamsplitter,  redirecting  (diffracting)  a  given  fraction 
of  each  input  beam  to  a  specified  set  of  output  beams.  By  employing  lenses  as  described 
above,  this  feature  allows  the  construction  of  a  point-to-point  interconnection  with  weights 
and  arbitrary  fan-out /fan-out  (delimited  only  by  the  number  of  gratings  recorded). 

There  is  at  least  one  obvious  problem  with  the  interconnection  scheme  outlined  above, 
however,  in  that  any  given  grating  will  connect  any  of  the  input  points  to  specific  output 
points  pairwise,  as  shown  by  Equation  (15).  This  particular  feature  occurs  because  each 
input  point  generates  a  collimated  beam  with  a  distinct  wave  vector  kr  corresponding  to 
a  particular  direction  (angle)  of  propagation,  each  of  which  satisfies  Equation  (15)  with  a 
different  diffracted  wave  vector  (for  each  diffracted  order)  km.  The  result  of  this  degeneracy 
is  that  any  recorded  hologram  that  is  designed  to  connect  a  single  input  point  to  one  or 
more  output  points  will  in  fact  also  connect  every  other  input  point  to  corresponding  sets 
of  output  points,  using  the  same  relative  interconnection  pattern  for  each  input  point. 
This  effect  can  be  utilized  to  advantage,  for  example,  in  parallel  digital  optical  computing 
systems  with  interconnection  symmetry  or  regularity,  since  one  simple  hologram  can  in 
effect  implement  a  very  large  number  of  point-to-point  interconnections  (the  equivalent  of 
wires  in  the  case  of  an  electronic  implementations)  [Jenkins,  1984].  For  neural  networks, 
however,  the  common  requirement  of  nearly  arbitrary  (highly  irregular)  interconnections 
makes  this  feature  undesirable 
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A  second  problem  with  the  proposed  interconnection  scheme  is  the  presence  of  a  mul¬ 
tiplicity  of  diffracted  orders  for  each  diffraction  grating,  as  shown  in  Figure  15.8.  which 
occasions  the  connection  of  each  input  point  to  a  number  of  geometrically  related  output 
points  even  for  the  case  of  a  single  stored  grating. 

The  solution  to  this  seeming  dilemma  is  to  extend  the  holographic  medium  into  the 
third  dimension  (the  direction  of  light  propagation),  creating  a  volume  holographic  optical 
element  (VHOE)  to  take  the  place  of  the  thin  planar  element  discussed  above.  There  are 
two  essential  properties  of  VHOEs  that  bear  directly  on  the  utilization  of  such  elements  in 
photonic  interconnections.  The  first  is  that  diffraction  is  limited  to  the  first  order  only  and 
all  higher  diffracted  orders  are  suppressed  if  the  holographic  medium  is  thick  enough,  as 
defined  below  and  as  shown  schematically  in  Figure  15.10.  This  occurs  because  each  addi¬ 
tional  “layer”  in  the  thickness  direction  of  the  holographic  medium  provides  an  additional 
constraint  on  the  diffraction  phenomenon;  these  constraints  act  collectively  to  enhance 
the  amplitude  diffracted  into  the  first  order  by  means  of  constructive  interference,  at  the 
expense  of  the  other  diffracted  orders. 


FIGURE  15.10  Volume  holographic  recording  with  plane  wave  signal  and 
reference  beams;  (a)  recording,  and  (b)  reconstruction,  showing  the  elimination 
of  the  higher  diffracted  orders. 


The  second  important  property  of  a  volume  holographic  optical  element  is  that  of 
angular  selectivity,  specifically,  the  range  of  input  angles  that  can  diffract  from  a  given 
grating  decreases  as  the  thickness  of  the  grating  is  increased.  The  central  angles  that  are 
allowed  in  the  case  of  a  thick  grating  are  the  same  angles  that  define  the  two  beams  that 
initially  created  the  holographic  grating.  This  property  therefore  eliminates  the  inadvertent 
connection  of  all  input  points  pairwise  to  a  matching  set  of  output  points,  and  allows  for 
the  generation  of  independent,  weighted  interconnections  as  are  desired  for  neural  network 
applications. 


In  order  to  differentiate  “thin”  grating  diffraction  behavior  (the  so-called  Raman-Nath 
diffraction  regime)  from  “thick”  grating  behavior  (the  so-called  Bragg  diffraction  regime), 
it  is  convenient  to  define  a  dimensionless  “thickness”  parameter  Q  such  that: 
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in  which  n  is  the  average  refractive  index  of  the  holographic  recording  medium,  and  the 
remaining  parameters  are  as  specified  previously.  In  general,  gratings  for  which  Q  >  10 
operate  well  within  the  Bragg  regime,  while  gratings  with  Q  parameters  less  than  unity 
exhibit  unacceptable  degrees  of  Raman-Nath  character  for  truly  independent  multiplexed 
interconnection  applications.  The  angular  response  characteristics  of  both  planar  and  vol¬ 
ume  diffraction  gratings  are  shown  as  a  function  of  the  Q  parameter  in  Figure  15.11,  in 
which  the  transition  from  pure  Raman-Nath  to  pure  Bragg  behavior  for  increasing  values 
of  Q  can  be  seen.  Note  that  the  number  of  input  points  that  can  be  independently  con¬ 
nected  to  an  equally-sized  array  of  output  points  is  a  decreasing  function  of  the  width  of 
the  angular  response. 


FIGURE  15.11  The  angular  alignment  sensitivity  of  a  volume  holographic 
optical  element,  as  a  function  of  the  dimensionless  Q-parameter  defined  in  the 
text.  The  grating  strength  for  all  of  the  curves  (3.14  radians)  is  optimized  to 
produce  100%  diffraction  efficiency  in  the  limit  of  large  Q  (Bragg  diffraction 
regime),  and  is  not  optimized  for  low  Q  gratings.  Note  that  the  diffraction 
efficiency  is  essentially  independent  of  angle  for  low  Q  gratings,  and  is  very 
strongly  peaked  at  the  Bragg  angle  (7.5  degrees  in  this  case)  for  high  Q  grat¬ 
ings. 

The  throughput  efficiency  of  a  volume  holographic  optical  element  as  used  in  an  in¬ 
terconnection  application  is  determined  to  first  order  by  the  diffraction  efficiency  of  each 
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individual  interconnection  grating,  in  direct  analogy  to  the  definition  of  the  diffraction 
effi  ciency  for  the  planar  hologram  case  in  Equation  (18).  For  example,  for  the  case  of  an 
unslanted  pure  phase  grating  with  equiphase  fronts  ( i.c .  planes  of  constant  phase)  parallel 
to  the  bisector  of  the  recording  beams  with  wave  vector  k  and  perpendicular  to  the  en¬ 
trance  face  of  the  volume  holographic  recording  medium,  the  diffraction  efficiency  at  the 
Bragg  (optimum  readout)  angle  is  given  by  [Kogelnik.  1969]: 

’  -  (S)  •  1201 

in  which  a  is  the  absorption  coefficient  of  the  holographic  recording  medium  of  thickness 
d  at  the  optical  readout  wavelength  A  .An  is  the  amplitude  of  the  refractive  index  modu¬ 
lation.  and  Ob  is  the  Bragg  angle  defined  bv  2fcsin0£  =  Kg-  As  can  be  seen  from  Equation 
(20).  the  diffraction  efficiency  of  the  first  order  for  a  single  grating  can  approach  100%  if 
the  absorption  coefficient  satisfies  the  requirement  ad  1,  provided  sufficient  index  mod¬ 
ulation  An  can  be  produced  by  the  exposure  process.  The  dependence  of  the  diffraction 
efficiency  on  the  grating  strength  is  shown  in  Figure  15.12  for  both  thin  (Raman-Nath) 
and  thick  (Bragg)  pure  phase  diffraction  gratings.  The  grating  strength  v  is  defined  as  the 
integrated  peak  phase  modulation  of  the  grating  in  each  case,  and  is  given  by: 


=  <21> 

The  maximum  diffraction  efficiency  of  the  thin  diffraction  grating  is  about  34%,  which 
occurs  at  a  grating  strength  of  1.8  radians.  Thick  diffraction  gratings  achieve  100%  diffrac¬ 
tion  efficiency  at  a  grating  strength  of  n  radians,  at  which  point  the  diffraction  efficiency 
of  the  thin  grating  has  peaked  and  is  nearly  at  its  first  node,  as  shown  in  Figure  15.12. 


FIGURE  15.12  The  diffraction  efficiency  of  thin  (Raman-Nath  diffraction 
regime)  and  thick  (Bragg  diffraction  regime)  holographic  gratings  as  a  function 
of  the  grating  strength. 


The  extremely  narrow  angular  alignment  characteristics  of  volume  diffraction  gratings 
in  principle  allow  the  simultaneous  multiplexing  of  large  numbers  of  independent,  weighted 
interconnections  to  be  recorded  between  the  input  plane  and  the  output  plane  (c.f.  Figure 
15.9).  In  addition,  the  use  of  angular  multiplexing  allows  for  both  fan-out  from  a  given 
input  point  to  a  number  of  output  points,  as  well  as  fan-in  from  a  number  of  input  points 
to  a  single  output  point. 

The  holographic  implementation  of  the  fan-out  from  a  single  input  point  to  a  number 
of  output  points  uses  several  multiplexed  (superimposed)  holographic  gratings  to  achieve 
the  desired  weighted  fan-out.  one  for  each  output  point.  Consider  a  4  input.  4  output  inter¬ 
connection  as  shown  in  Figure  15.13.  For  each  input  point  x;  that  we  wish  to  interconnect 
to  an  output  point  y',  the  recording  process  requires  the  pairwise  coherent  interference 
within  the  holographic  recording  medium  of  x3  with  a  second  beam  y,  corresponding  to 
y[.  The  interconnection  of  Xi  to  y[,y'2, 2/3,  and  y'4  therefore  requires  the  pairwise  coherent 
interference  of  Xi  with  yi,  jq  with  y2,  and  so  on.  This  process  results  in  the  fourfold  fan-out 
of  Ji  to  all  of  the  outputs. 

The  fan-out  from  a  single  reference  beam  to  a  number  of  output  beams  is  directly 
analogous  to  the  readout  of  a  traditional  hologram  (of,  for  example,  a  two-dimensional  or 
three-dimensional  image),  provided  that  the  full  set  of  beams  { y, }  is  coherently  record¬ 
ed  with  the  given  reference  beam  xy  Although  up  to  this  point  we  have  formulated  the 
point-to-point  holographic  interconnection  problem  in  terms  of  collimated  (plane  wave)  in¬ 
put  and  output  beams  that  record  individual  diffraction  gratings  (characterized  by  a  single 
grating  wave  vector)  within  the  holographic  recording  medium,  many  alternative  recording 
and  reconstruction  geometries  can  be  envisioned  that  produce  equivalent  results.  In  the 
case  of  traditional  holography,  for  example,  the  input  transparency  bearing  the  image  to  be 
recorded  is  illuminated  with  a  collimated  beam,  resulting  in  a  complex  diffraction  pattern 
at  the  front  entrance  plane  of  the  holographic  recording  medium.  Collimated,  converging, 
or  diverging  reference  beams  can  be  utilized  to  produce  reconstructed  images  with  a  wide 
variety  of  optical  imaging  characteristics.  Likewise,  various  input  and  output  beam  geome- 


tries  can  be  used  in  a  point-to-point  interconnection  system  to  optimize  the  overall  system 
characteristics,  such  as  freedom  from  interchannel  crosstalk,  optimum  use  of  the  spatial 
frequency  recording  characteristics  of  the  holographic  recording  medium,  optical  system 
complexity,  and  convenience  of  the  optical  layout  (particularly  when  viewed  in  conjunction 
with  associated  optical  subsystems). 


FIGURE  15.13  Schematic  representation  of  a  1  input.  1  output  holographic 
interconnection,  showing  4  coherent  input  beams  xx-x4  and  4  coherent  record¬ 
ing  beams  y\-y4.  each  of  which  corresponds  to  a  desired  output  y[-y4-  In  (a),  the 
sets  {xj}  and  {?/,}  interfere  within  the  volume  holographic  medium,  recording 
the  desired  interconnection  diffraction  gratings.  In  ( b ).  a  new  set  of  input  beam- 
s  {xj}  illuminates  the  volume  holographic  medium,  reading  out  the  weighted 
interconnection  pattern  and  forming  appropriately  weighted  sums  at  each  of 
the  outputs  {?/,'}. 


The  fourfold  fan-in  of  inputs  xi,x2,x3,  and  x4  to  jq  can  likewise  be  accomplished  by 
recording  each  of  the  necessary  interconnections  pairwise,  as  before  for  the  fan-out  case. 
The  recording  process  for  the  fully  implemented  4  to  4  interconnection  therefore  involves 
the  generation  of  16  individually  weighted  diffraction  gratings  that  connect  the  full  set  of 
inputs  {x;}  to  the  full  set  of  outputs  {y'}.  The  multiplexed  hologram  that  accomplishes 
this  function  can  be  recorded  in  a  number  of  ways,  each  characterized  by  certain  advantages 
and  disadvantages  [Psaltis,  1988]. 

In  the  fully  coherent  approach,  the  requisite  gratings  can  be  recorded  by  illuminating 
the  holographic  recording  medium  with  {xj}  and  with  {y,}  simultaneously.  This  can  be 
accomplished,  for  example,  by  using  a  spatial  light  modulator  to  store  each  of  the  sets  of 
values,  and  a  pair  of  mutually  coherent  readout  beams  to  encode  these  values  and  interfere 
them  within  the  holographic  element.  In  this  manner,  all  of  the  required  gratings  are 
recorded  in  a  single  exposure;  however,  there  are  two  difficulties  inherent  in  this  single 


exposure,  fully  coherent  approach.  The  first  problem  is  that  a  fully  independent  A’  to  M 
interconnection  requires  A TM  stored  interconnection  weights,  whereas  the  single  exposure 
described  above  supplies  only  X  +  M  input  values  that  can  be  used  to  generate  the  weights. 
The  resulting  interconnection  matrix  can  in  fact  connect  all  of  the  input  points  to  all  of  the 
output  points,  but  the  relative  fan-out  weights  from  each  input  point  will  be  degenerate. 
One  way  to  avoid  this  degeneracy  is  to  illuminate  the  holographic  recording  medium  with 
each  input  Xj  and  a  full  set  of  corresponding  outputs  {y,}.  sequencing  through  all  A  of 
the  inputs  (and  changing  the  set  of  corresponding  outputs)  one  at  a  time.  This  procedure 
generates  an  independent  fan-out  from  each  input  point.  The  second  problem  with  the 
single  exposure,  fully  coherent  approach  is  that  undesirable  gratings  will  be  recorded  among 
the  {jj}  and  among  the  {y,}  that  can  lead  to  considerable  coherent  crosstalk  among  the 
desired  interconnection  pairs.  This  coherent  interference  process  diminishes  the  degree  of 
independence  of  the  interconnections. 

This  coherent-recording-induced  crosstalk  can  also  be  avoided  by  sequencing  the  record¬ 
ing,  but  in  this  case  each  desired  grating  pair  is  recorded  separately  such  tiiat  only  one 
input  beam  x3  interferes  with  one  output  beam  generator  y,  (recording  beam  for  the  desired 
output  beam  y\)  at  a  time.  This  scheme  effectively  eliminates  the  coherent  crosstalk,  but 
does  not  eliminate  another  form  of  crosstalk  (called  beam  degeneracy  crosstalk  [Jenkins. 
1990a;  Jenkins,  1990b;  Asthana,  1990c],  the  origin  of  which  is  described  below)  that  can 
be  equally  severe;  in  addition,  the  complication  imposed  by  the  incorporation  of  such  a 
sequential  recording  schedule  can  be  a  serious  constraint  for  large  N  x  M  (N  input  points 
to  M  output  points)  interconnections,  as  NM  independent  recording  steps  are  required 
for  full  programming  of  the  interconnection.  This  proves  to  be  particularly  problematic 
for  the  rapid  generation  of  weight  updates  in  a  large  scale  neural  interconnection  network 
that  incorporates  synaptic  plasticity.  Furthermore,  sequential  recording  of  holographic 
exposures  can  cause  partial  erasure  of  previously  recording  interconnection  weights  in  cer¬ 
tain  types  of  holographic  recording  materials,  necessitating  the  use  of  recording  schedules 
that  attempt  to  balance  the  weights  recorded  at  the  beginning  of  the  sequence  (and  hence 
partially  erased  by  all  subsequent  exposures)  with  the  weights  recorded  at  the  end  of  the 
sequence  [Psaltis,  1988].  The  use  of  such  recording  schedules  usually  implies  an  overall 


decrease  in  both  the  exposure  efficiency  and  throughput  efficiency  of  the  resulting  holo¬ 
graphically  recorded  interconnection  matrix,  as  well  as  the  buildup  of  noise  resulting  from 
the  series  of  space- variant  erasures. 

One  potential  scheme  for  reducing  coherent-recording-induced  crosstalk,  beam  degen¬ 
eracy  crosstalk,  and  sequential  recording  schedules  involves  the  use  of  an  array  of  coherent 
but  mutually  incoherent  sources  to  simultaneously  expose  the  holographic  recording  medi¬ 
um  to  only  the  desired  sets  of  gratings  [Jenkins.  1990a:  Jenkins.  1990b;  Asthana.  1990c], 
This  scheme  will  be  discussed  in  detail  in  a  later  section. 

The  fan-out  process  is  illustrated  in  Figure  15.14.  in  which  implementations  using  both 
beamsplitters  and  volume  holographic  optical  elements  are  shown.  The  case  of  fan-out 
utilizing  beamsplitters  is  shown  schematically  in  Figure  15.14(a).  As  can  be  seen  in  the 
figure,  the  input  beam  can  be  divided  among  the  output  beams  with  arbitrary  weights  set 
by  the  transmissivities  of  the  beamsplitting  elements  BS,.  If  the  final  beamsplitter  is  a 
mirror,  the  fan-out  process  can  be  accomplished  with  essentially  zero  throughput  loss.  By 
analogy  to  the  beamsplitter  case,  as  well  as  by  direct  analysis,  it  can  be  proven  that  the 
holographic  fan-out  process  shown  in  Figure  15.14(b)  can  also  be  accomplished  with  es¬ 
sentially  arbitrary  weights,  with  no  optical  throughput  loss  inherent  in  the  fan-out  process 
itself.  It  is  interesting  to  note  that  these  two  implementations  differ  in  at  least  one  essen¬ 
tial  feature,  in  that  the  beams  fanned  out  from  the  holographic  implementation  originate 
within  the  same  volume,  while  the  beams  fanned  out  from  the  beamsplitter  implementa¬ 
tion  originate  from  vertically  displaced  beamsplitters.  If  we  were  to  extend  the  fanned  out 
beams  in  the  latter  case  backwards  toward  the  left  hand  side  of  Fig.  15.14(a),  we  could 
imagine  replacing  the  three  discrete,  vertically  displaced  beamsplitters  with  a  single,  mul¬ 
tiplexed  “virtual”  beamsplitter  that  generates  the  same  set  of  output  beams.  One  physical 
realization  of  such  a  “virtual”  beamsplitter  component  is  in  fact  the  multiplexed  volume 
hologram  shown  in  Figure  15.14(b). 


FIGURE  15.14  Schematic  representation  of  the  fan-out  process  for  optical 
beams,  for  the  case  of  one  input  and  three  outputs:  (a)  with  beamsplitters 
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(BS i  -  BS3)-.  (b)  with  a  single  holographic  optical  element  containing  three 
multiplexed  (spatially  superimposed)  diffraction  gratings. 


The  collinear  fan-in  process  is  illustrated  in  Figure  15.15  for  both  types  of  implemen¬ 
tations.  As  was  discussed  above,  for  the  beamsplitter  implementation  an  intrinsic  fan-in 
loss  is  encountered  for  the  case  of  collinear  fan-in.  while  the  intrinsic  loss  can  be  circum¬ 
vented  by  resorting  to  mirrors  and  employing  angular  multiplexing.  For  the  case  of  volume 
holographic  optical  elements,  the  situation  is  identical,  such  that  collinear  fan-in  is  grossly 
inefficient  for  large  numbers  of  fan-in  interconnections  to  the  same  node.  On  the  other 
1  and.  appropriate  use  of  angular  multiplexing  can  eliminate  this  seemingly  inherent  fan-in 
loss,  giving  rise  to  a  highly  multiplexed,  efficient  interconnection  element  [Jenkins.  1990a; 
Jenkins.  1990b;  Asthana,  1990c]  as  described  in  a  later  section. 


FIGURE  15.15  Schematic  representation  of  the  fan-in  process  for  optical 
beams,  for  the  case  of  three  angularly  distinct  inputs  and  one  combined  collinear 
output  beam;  (a)  with  beamsplitters,  showing  the  unavoidability  of  a  through¬ 
put  loss  associated  with  the  set  of  transmitted  (and  multiply  reflected)  beams; 
(b)  with  a  single  holographic  optical  element  containing  three  multiplexed  (s- 
patially  superimposed)  diffraction  gratings,  showing  an  analogous  throughput 
loss. 


The  physical  origin  of  this  intrinsic  optical  throughput  loss  in  the  case  of  collinear 
fan-in  is  directly  related  to  the  mechanism  that  gives  rise  to  beam  degeneracy  crosstalk. 
In  Figure  15.16  we  show  a  4  to  4  holographic  interconnection  that  is  assumed  to  have 
been  recorded  by  the  sequential  exposure  technique  described  above  in  reference  to  Figure 
15.13,  in  order  to  include  all  16  individually  weighted  interconnection  gratings  but  none 
of  the  undesirable  gratings  that  can  give  rise  to  coherent-recording-induced  crosstalk.  In 


this  case,  readout  by  the  input  beam  xj  generates  the  four  output  beams  y\  through  y\. 
with  values  given  by  the  stored  interconnection  weights  wl}\ 


y[  =  VL\xx  j.  (22) 

Within  the  holographic  medium,  however,  each  of  the  four  output  beams  can  in  turn 
act  as  an  input  beam,  generating  undesired  output  beams  in  the  directions  Xj.xf,,  and 
x'A.  These  undesired  output  beams  are  a  result  of  diffraction  from  the  gratings  recorded 
between  each  output  generating  beam  y,  and  the  full  set  of  input  beams  {x;}.  Each  output 
beam  is  automatically  Bragg  matched  (at  the  correct  Bragg  angle)  to  the  full  set  of  input 
beams  due  to  the  collinear  recording  geometry  employed.  We  refer  to  the  fan-in  as  collinear 
in  this  case  because  each  input  beam  x}  that  is  fanned  in  to  a  given  output  y[  produces  an 
output  beam  in  the  same  direction.  The  generation  of  diffracted  intensity  in  the  directions 
x'2-i'4  from  readout  with  Xi  results  in  a  throughput  loss  for  the  interconnections  between 
xi  and  the  set  of  output  beams  {y'}.  In  addition,  the  throughput  losses  of  the  individual 
output  beams  {y,'}  will  not  be  equal  in  general.  Furthermore,  the  undesired  diffracted 
beams  x'2-x\  can  also  act  as  input  beams,  generating  additional  output  beams  in  the  direc¬ 
tions  {y'}  that  coherently  interfere  with  the  beams  directly  diffracted  in  those  directions 
by  the  input  beam  X\.  The  combination  of  interconnection-dependent  losses  from  the  out¬ 
put  beams  {y'}  into  the  “cross-coupled”  beams  {x(},  and  of  interconnection-dependent 
coupling  from  {x[}  into  { y[ }  gives  rise  to  an  undesired  redistribution  of  the  intensities  of 
the  output  beams.  This  phenomenon  is  referred  to  as  beam  degeneracy  crosstalk,  as  it 
arises  from  the  beam  direction  degeneracy  (collinearitv)  of  the  output  beams  fanned  into 
a  single  output  point. 


FIGURE  15.16  Illustration  of  the  generation  of  crosstalk  in  holographic 
optical  interconnections  due  to  beam  degeneracy:  recording/readout  config¬ 
uration.  The  input  beams  {x;}  are  assumed  to  have  interfered  within  the 
volume  holographic  medium  with  the  set  of  recording  beams  {y,},  producing 
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the  desired  set  of  interconnection  gratings  with  weights  ivtJ.  Illumination  of 
the  volume  holographic  medium  with  beam  Xi  produces  a  1  to  4  fanout  into 
the  output  beams  {y'}.  as  well  as  the  zeroth  order  beam  rj.  Due  to  the  effects 
of  beam  degeneracy,  power  is  also  coupled  into  the  zeroth  order  beams  x'2-x\. 
and  crosstalk  terms  {c,}  are  introduced  into  the  outputs. 


Both  the  throughput  loss  and  the  beam  degeneracy  crosstalk  that  characterize  holo¬ 
graphic  interconnection  geometries  with  collinear  fan-in  can  be  estimated  by  numerical 
simulation  of  the  diffraction  process  from  a  multiplexed  grating  [Asthana.  1990c].  By 
using  the  optical  beam  propagation  method  [Johnson,  1986]  to  simulate  the  diffraction 
process,  we  can  analyze  the  4  to  4  interconnection  described  above  for  the  case  of  a  single 
beam  readout,  as  shown  in  Figure  15.16.  The  results  of  such  an  analysis  are  presented  in 
Figure  15.17,  which  shows  the  diffraction  efficiency  of  each  of  the  four  beams  fanned  out 
from  the  single  input  point,  as  well  as  the  three  cross-coupled  beams  in  the  directions  {i' } 
and  the  zero  order  (un diffracted)  beam.  For  this  illustration,  all  16  interconnection  weights 
are  equal  in  magnitude.  As  the  grating  strength  is  increased,  a  significant  amount  of  inten¬ 
sity  is  coupled  into  the  cross-coupled  components,  robbing  the  desired  fan-out  beams  of  the 
desired  diffraction  efficiency.  In  addition,  the  relative  diffraction  efficiencies  observed  in  the 
designated  fan-out  beams  are  no  longer  independent  of  the  grating  strength,  as  desired  in 
a  fully  independent  weighted  interconnection.  Extensive  modeling  of  N-to-N  holographic 
interconnections  with  collinear  fan-in  suggests  that  the  throughput  loss  increases  approxi¬ 
mately  as  l/N,  which  is  potentially  catastrophic  for  large  interconnection  networks.  In  a 
later  section,  we  will  describe  an  alternative  holographic  recording  approach  that  obviates 
this  1  /N  loss. 


FIGURE  15.17  Illustration  of  the  generation  of  crosstalk  in  holographic  op¬ 
tical  interconnections  due  to  beam  degeneracy:  diffraction  efficiency  as  a  func¬ 
tion  of  grating  strength  for  the  readout  configuration  of  Figure  15.16.  Shown 
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are  the  depletion  of  the  zero  order  beam  x't  and  the  rise  of  the  desired  output 
beams  y[,  accompanied  by  a  strong  buildup  of  the  cross-coupled  beams  x'2-x'l. 


The  development  of  a  viable  photonic  interconnection  technology  is  based  in  no  s- 
mall  part  on  the  availability  of  appropriate  photosensitive  recording  materials  [Psaltis. 
1988;  Smith,  1977;  Gunter.  1988;  Gunter,  1989],  Many  interconnection  demonstration 
experiments  have  been  performed  in  the  laboratory  on  bleached  photographic  emulsions 
and  dichromated  gelatin  films,  both  of  which  are  thick  enough  (10-30  pm)  to  exhibit 
sufficient  Bragg-like  diffraction  behavior  to  allow  a  limited  degree  of  multiplexing  to  be 
incorporated.  Neither  material,  however,  exhibits  capacity  for  real  time  operation,  which 
is  essential  for  the  implementation  of  photonic  neural  networks  with  at  least  some  degree 
of  synaptic  plasticity.  On  the  other  hand,  one  principal  advantage  of  photographic  film 
and  dichromated  gelatin  is  their  essentially  infinite  read-write  asymmetry,  which  is  highly 
desirable  in  many  applications  as  described  below. 

By  real  time  operation ,  we  mean  that  the  holographic  interconnections  can  be  pro¬ 
grammed  (exposed)  and  used  (read  out)  on  roughly  the  same  time  scale  (perhaps  at  kHz 
frame  rates),  without  the  necessity  of  chemical  development  processes  or  the  like.  By  read- 
write  asymmetry ,  we  mean  that  the  readout  of  a  programmed  interconnection  should  not 
erase  the  stored  weights  at  an  accumulated  readout  exposure  equal  to  that  of  the  recording 
exposure.  Ideally,  we  would  like  to  have  the  capability  of  exposing  the  holographic  inter¬ 
connection  to  the  recording  beams  with  essentially  instantaneous  “development"  of  the 
stored  gratings,  with  the  recording  process  characterized  by  very  high  sensitivity  during 
the  “learning”  process.  At  the  same  time,  we  would  like  to  be  able  to  initiate  readout  of 
the  stored  interconnection  pattern  without  altering  the  stored  weights  for  a  length  of  time 
equal  to  the  desired  “computation”  time.  Although  in  many  applications  the  learning  and 
computation  times  may  differ  by  only  an  order  of  magnitude,  in  other  cases  it  is  desirable 
to  compute  for  very  long  times  compared  with  the  learning  phase,  and  yet  still  maintain 
the  capacity  for  (slowly  varying)  weight  updates. 

The  class  of  photosensitive  recording  materials  that  has  been  most  extensively  investi- 
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gated  for  photonic  interconnection  applications  does  in  fact  have  the  capacity  for  sensitive 
holographic  recording,  is  available  in  “thick"  samples  that  allow  for  the  formation  of  Bragg- 
regime  diffraction  gratings,  exhibits  a  high  multiplexing  capacity,  and  allows  for  the  inclu¬ 
sion  of  modest  read-write  asymmetries.  This  class  is  that  of  the  so-called  "photorefractive" 
materials  [Gunter.  1988:  Gunter,  1989],  which  includes  single  crystals  of  semi-insulating 
optical  materials  such  as  bismuth  silicon  oxide  ( B112S1O20 ),  bismuth  germanium  oxide 
( BinGe02o ),  lithium  niobate  ( LiXb03 ),  strontium  barium  niobate  (Srl_rBar.Xb206). 
potassium  niobate  (AW6O3)  and  barium  titanate  ( BaTiOz ),  as  well  as  semi-insulating 
semiconductors  such  as  gallium  arsenide  ( GaAs ),  indium  phosphide  ( InP )  and  cadmi¬ 
um  telluride  ( CdTe ).  The  use  of  the  term  ‘'photorefractive”  to  describe  these  materials 
exclusively  is  somewhat  misleading,  in  that  many  other  classes  of  materials  are  known 
to  undergo  a  refractive  index  change  foliowing  illumination  as  well  as  those  traditionally 
included  in  the  class  described  above.  But  at  least  the  term  is  descriptive  of  the  basic 
phenomenon  involved,  as  outlined  below. 

In  photorefractive  materials  such  as  bismuth  silicon  oxide,  exposure  to  an  interference 
pattern  at  an  appropriate  wavelength  (characterized  by  significant  photosensitivity)  gen¬ 
erates  free  charge  carriers  (electrons  or  holes)  liberated  from  deep  traps.  The  number  of 
photogenerated  carriers  is  in  general  proportional  to  the  local  intensity  absorbed  by  the 
crystal;  as  such,  the  photogenerated  carrier  population  mimics  the  exposure  pattern  in 
both  amplitude  and  phase.  The  photogenerated  carriers  are  free  to  diffuse  to  regions  of 
lower  intensity,  or  they  can  be  assisted  out  of  the  brightest  regions  by  application  of  a  bias 
electric  field  to  produce  carrier  drift.  In  either  case,  they  tend  to  be  retrapped,  in  turn 
creating  a  space  charge  distribution  that  has  the  same  spatial  frequency  as  the  interfer¬ 
ence  pattern.  This  space-variant  space  charge  distribution  produces  a  locally  modulated 
electric  field  with  the  same  spatial  frequency  (as  determined  by  the  grating  spacing  or 
grating  wavelength),  which  in  turn  induces  a  local  change  in  the  refractive  index  of  the 
photorefractive  material  through  the  linear  (Pockels)  or  quadratic  (Kerr)  electrooptic  ef¬ 
fect  (Kaminow,  1974].  The  refractive  index  grating  can  then  be  probed  by  a  readout  beam 
to  generate  a  diffracted  beam,  just  as  in  the  case  of  the  pure  phase  gratings  described 
previously. 
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An  excellent  set  of  review  articles  on  the  physical  properties  and  applications  of  pho- 
torefractive  materials  has  been  assembled  by  Gunter  and  Huignard  [Gunter,  1988:  Gunter. 
1989],  The  state  of  the  art  is  such  that  1  cm3  crystals  of  many  of  these  materials  have 
been  grown,  and  shown  to  exhibit  a  very  high  degree  of  optical  quality.  Exposure  sensi¬ 
tivities  vary  widely,  but  several  crystals  require  of  order  500  / iJ/cm 3  for  full  exposure  to 
saturation  (the  highest  grating  strength  that  can  be  achieved  in  that  particular  crystal). 
This  corresponds  to  the  absorption  of  about  50  mil’/ cm 2  of  optical  intensity  throughout 
1  cm3  of  material  for  an  exposure  period  of  10  msec.  The  range  of  spatial  frequencies  that 
can  be  supported  in  these  materials  ranges  from  a  few  lines/mm  to  over  2000  lines/mm. 
Diffraction  efficiencies  close  to  100%  have  been  observed  in  several  types  of  crystals,  while 
others  saturate  nearer  to  \0%  for  thicknesses  of  order  1  cm. 

Optimization  of  photorefractive  materials  for  interconnection  device  applications  is  un¬ 
der  way,  including  the  development  of  growth  processes  for  large  photorefractive  crystal 
boules  with  a  high  degree  of  optical  uniformity;  the  characterization  of  both  unintentionally 
incorporated  impurities  and  intentionally  incorporated  dopants  that  alter  the  holographic 
recording,  readout,  and  storage  characteristics;  the  use  of  applied  d.c.  and  a.c.  bias  elec¬ 
tric  fields  to  enhance  the  holographic  recording  sensitivity:  the  use  of  polarization  effects 
to  enhance  the  reconstructed  image  signal-to-noise  ratio:  and  the  antireflection  coating  of 
the  (typically  high  index)  front  and  rear  surfaces  to  increase  the  diffraction  efficiency  and 
avoid  the  presence  of  unwanted  gratings  due  to  multiple  reflections  [Karim.  1988:  Karim. 
1989a;  Karim,  1989b],  In  addition,  the  origin  of  electric  field  nonuniformities  that  occur 
within  photorefractive  crystals  during  grating  recording  is  under  active  investigation,  and 
several  methods  of  eliminating  the  field  collapse  have  been  discovered  [Herbulock,  1988], 
Use  of  these  methods  increases  both  the  saturation  diffraction  efficiency  and  the  grating 
response  time,  resulting  in  more  efficient  interconnection  devices  that  operate  at  higher 
recording  sensitivities. 
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Sources  and  Source  Arrays 


In  reviewing  a  large  fraction  of  the  journal  articles  published  over  the  past  decade 
on  optical  information  processing  and  computing,  including  the  most  recent  coverage  of 
photonic  implementations  of  neural  networks,  you  will  be  inspired  perhaps  by  the  cleverness 
of  a  particular  proposed  architecture,  or  intrigued  by  the  novel  features  of  a  particular 
device  structure.  But  you  will  also  be  amazed  at  the  apparent  lack  of  emphasis  on  certainly 
one  of  the  most  fundamental  components  in  any  proposed  photonic  computational  system: 
the  source  of  the  light!  This  oversight  may  be  caused  in  part  by  direct  analogy  to  the 
situation  in  VLSI  electronics,  in  which  it  is  a  bit  unglamorous  (and  probably  also  to  a 
certain  extent  unnecessary)  to  concentrate  on  the  battery  or  the  power  supply.  After 
all,  electrical  power  is  relatively  inexpensive,  widely  available,  well  characterized,  and 
reasonably  abundant.  At  peak  usage,  your  home  probably  uses  about  10  kW,  most  of 
which  is  dissipated  in  the  air  conditioner. 

However,  the  situation  in  photonic  technology  is  quite  different.  Sources  of  coherent 
optical  radiation  that  can  produce  average  output  powers  in  the  10  kW  range  exist  in 
only  a  few  laboratories,  are  very  large  (about  15  m3),  usually  emit  in  the  far  infrared 
(10.6  pm),  and  are  far  from  inexpensive.  Incoherent  sources  in  the  range  of  100  -  1000  li 
are  available  (xenon-mercury  ( Xe-Hg )  gas  discharge  lamps,  for  example),  but  this  type  of 
source  is  typically  noisy  (exhibits  large  intensity  fluctuations),  difficult  to  collimate,  and 
characterized  by  a  very  short  lifetime  (from  the  systems  perspective).  In  addition,  gas 
discharge  lamps  are  broadband  sources,  and  as  such  usually  require  wavelength  filtering 
in  order  to  provide  compatibility  with  wavelength  sensitive  devices  such  as  volume  holo¬ 
graphic  optical  elements  and  spatial  light  modulators.  A  broadband  source  that  has  been 
suitably  filtered  to  allow  readout  of  a  typical  volume  holographic  optical  element  (within 
the  allowable  spectral  bandwidth  of  the  stored  diffraction  gratings)  might  generate  only 
about  10~5  -  10-6  of  its  total  rated  power  in  the  wavelength  region  of  interest.  For  the 
1000  W  Xe-Hg  lamp,  this  results  in  only  about  1  -  10  mW  of  quasi-monochromatic  optical 
power. 
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Coherent,  monochromatic  optical  power  can  be  provided  by  an  array  of  different  types 
of  laser  sources  [Milonni,  1988],  including  the  argon-ion  (.4r+)  laser,  the  neodymium- YAG 
(Xd-YAG)  laser,  the  helium  neon  (He- A’e)  laser,  the  helium  cadmium  ( He-Cd )  laser,  dye 
lasers,  excimer  lasers,  and  semiconductor  laser  diodes.  Typical  monochromatic  (single  laser 
line)  power  outputs  from  the  first  two  types  range  from  about  500  m\Y  to  25  H\  Helium 
neon  and  helium  cadmium  lasers  are  readily  available  as  well  as  relatively  inexpensive,  but 
have  output  powers  that  are  typically  in  the  range  1  -  5  mil’,  peaking  out  at  about  50  m\Y . 
Dye  lasers  are  often  optically  pumped  by  argon-ion  lasers,  and  hence  exhibit  power  outputs 
slightly  lower  than  that  of  the  pump  laser.  Excimer  lasers  are  typically  operated  in  the 
pulsed  mode  of  operation  at  repetition  rates  of  10  -  1000  pulses  per  second,  and  emit 
average  powers  in  the  10  -  100  IV  range.  Finally,  semiconductor  laser  diodes  are  available 
with  very  long  lifetimes  at  output  powers  of  1  -  20  mlV,  and  much  shorter  lifetimes  in  the 
100  mW  -  1  VV  range. 

Of  these  six  different  types  of  coherent  sources,  the  first  five  are  still  relatively  bulky 
(about  0.1  m3),  consume  considerable  electrical  power,  generate  significant  amounts  of 
heat  (many  must  be  water  cooled  to  ensure  stable  operation  and  practical  lifetimes),  and 
are  very  expensive  (especially  when  compared  with  a  comparable  electronic  power  sup¬ 
ply!).  Although  these  sources  can  be  (and  indeed  are)  employed  in  current  systems-level 
demonstrations,  their  collective  liabilities  do  not  augur  well  for  their  eventual  incorpora¬ 
tion  in  commercially  viable  computational  systems  in  general,  and  perhaps  neural  network 
applications  in  particular.  This  leaves  the  last  category,  that  of  semiconductor  diode  lasers 
(including,  possibly,  miniaturized  diode-pumped  Nd-  YA  G  lasers),  for  further  consideration. 

Before  discussing  the  properties  of  semiconductor  diode  lasers  as  optical  power  sources 
any  further,  we  should  at  least  note  that  the  range  of  output  powers  available  from  these 
sources  (1  -  100  mW  for  single  element  devices)  is  rather  limited.  Taking  an  upper  bound 
(with  continued  research  and  development)  of  about  a  watt  per  device  gives  us  a  realistic 
estimate  of  the  amount  of  average  coherent  source  power  available  for  at  least  circuit  level 
implementation  of  photonic  neural  networks,  though  certainly  at  the  systems  level  phased 
arrays  of  stripe  laser  diodes  and/or  multiple  sources  could  conceivably  be  employed. 

Semiconductor  diode  lasers  (Kressel,  1977;  Casey,  1978a;  Casey,  1978b]  have  been  ex- 


tensively  investigated  and  developed  over  the  past  two  decades  for  a  broad  range  of  com¬ 
mercial  applications,  including  compact  disk  player  recording  and  readout,  fiber  optical 
communications  systems  (Jones.  1988],  merchandise  optical  scanners,  and  laser  printers. 
The  physical  size  of  these  lasers  is  small  enough  (about  0.3  x  1  x  5  mm)  to  fit  in  a  standard 
transistor  (or  IC)  package,  as  long  as  external  cooling  is  not  required.  Lasers  with  power 
outputs  of  1  -  10  mil'  are  relatively  inexpensive,  costing  a  few  tens  of  dollars  in  quantity 
on  the  average.  Higher  output  power  lasers  are  considerably  more  expensive,  however,  as 
are  lasers  with  very  narrow  spectral  linewidths  (so-called  single  longitudinal  mode  lasers). 
For  the  higher  power  lasers  (as  well  as  for  the  intermediate  power  lasers  that  are  required 
to  maintain  a  high  degree  of  center  wavelength  accuracy),  external  cooling  (e.g.  by  means 
of  a  thermoelectric  cooler)  must  be  provided  in  order  to  maintain  thermal  stability  in  both 
wavelength  and  output  power. 

The  wavelength  ranges  spanned  by  semiconductor  diode  lasers  are  dictated  by  the 
direct  bandgap  materials  used  to  fabricate  the  coherent  light-emitting  diode  (semiconduc¬ 
tor  p-n  junction).  Aluminum  gallium  arsenide/gallium  arsenide  (AlGaAs/GaAs)  lasers 
grown  on  single  crystal  gallium  arsenide  substrates  emit  at  wavelengths  in  the  range  780 
to  900  nm,  while  lasers  based  in  the  quaternary  indium  gallium  arsenide  phosphide  {In- 
GaAsP)  compound  semiconductor  system  (and  grown  on  indium  phosphide  substrates) 
emit  at  wavelengths  further  into  the  infrared  (1.2  to  1.6  pm).  The  aluminum  gallium 
arsenide/gallium  arsenide  lasers  in  particular  are  nearly  wavelength  matched  to  the  peak 
sensitivity  of  both  silicon  and  gallium  arsenide  photodetectors,  as  might  be  employed  for 
photonic  switching  in  spatial  light  modulator  arrays,  or  for  detection  of  computed  results 
in  a  system  diagnostic  or  output  plane. 

Within  these  ranges,  a  typical  multimode  semiconductor  diode  laser  has  a  spectral 
bandwidth  of  0.5  -  2  nm;  a  single  longitudinal  mode  laser  has  a  much  narrower  spectral 
bandwidth  of  order  10-4  nm  (about  50  MHz  centered  at  an  optical  frequency  of  3.5  x 
1014  Hz).  Both  multimode  and  single  longitudinal  mode  diode  lasers  can  be  used  to 
write  and  read  holographic  optical  interconnection  elements,  as  long  as  the  coherence 
length  of  the  laser  is  larger  than  the  thickness  of  the  holographic  recording  medium.  The 
coherence  length  of  a  laser  is  essentially  the  maximum  path  difference  over  which  two 


beams  derived  from  the  same  laser  can  maintain  the  stable  phase  relationship  necessary  to 
exhibit  an  interference  pattern.  In  applications  requiring  high  multiplexing  capacity  within 
the  holographic  interconnection  medium  (or  significant  path  differences  among  beams  that 
must  coherently  interfere),  the  narrower  linewidths  of  the  single  longitudinal  lasers  are 
often  preferable  since  their  coherence  lengths  are  several  orders  of  magnitude  longer.  For 
example,  typical  multimode  semiconductor  diode  lasers  operated  above  threshold  exhibit 
coherence  lengths  in  the  range  0.1  -  10  mm,  while  stabilized  single  longitudinal  mode  diode 
lasers  can  have  coherence  lengths  exceeding  1  m. 

Employing  a  single,  high  intensity  optical  power  source  in  a  typical  neural  network  ap¬ 
plication  carries  with  it  a  potential  penalty:  an  inherent  tradeoff  between  energy  efficiency 
on  the  one  hand,  and  the  need  for  array  generation  optics  on  the  other.  This  tradeoff 
arises  from  the  fact  that  most  optoelectronic  implementations  of  neuron  unit  arrays  have 
either  photodetectors  or  modulation  windows  (in  some  cases  both)  that  are  smaller  in  size 
than  each  individual  pixel,  as  was  shown  schematically  in  Figures  15.4  and  15.7.  The  ratio 
of  the  area  of  a  given  photosensitive  element  to  the  entire  pixel  area  is  referred  to  as  the 
fill  factor  of  the  pixel  (with  respect  to  that  particular  element).  Typical  fill  factors  for 
the  photodetectors  and  modulation  windows  may  range  from  less  than  0.1  in  the  case  of 
monolithic  integration  to  about  0.5  for  hybrid  integrated  devices.  Light  that  falls  outside 
the  appropriate  areas  within  a  given  pixel  will  at  best  contribute  to  the  overall  system 
throughput  loss,  and  at  worst  may  adversely  affect  the  function  of  adjacent  devices  that 
exhibit  photosensitivity. 

In  order  to  efficiently  channel  the  optical  illumination  to  the  correct  photosensitive 
regions,  we  need  to  (a)  expand  the  source  illumination  uniformly  to  fill  the  entire  aperture 
of  the  device  in  question  (a  spatial  light  modulator  or  volume  holographic  optical  element, 
for  example),  (b)  in  many  cases  collimate  (or  re-collimate)  the  light  source  to  produce  a 
planar  wavefront  with  a  beam  of  constant  width,  (c)  spatially  filter  the  beam  to  enhance 
its  uniformity  by  eliminating  significant  fixed-pattern  noise,  (d)  focus  the  light  within  each 
individual  pixel  to  a  size  compatible  with  the  relevant  photosensitive  area  (in  effect  thereby 
generating  a  two-dimensional  array  of  focused  beamlets),  and  (e)  align  the  resulting  array 
of  focused  beamlets  with  each  succeeding  device  in  the  optical  path. 


The  procedures  and  optical  elements  required  for  beam  expansion,  collimation.  and 
spatial  filtering  are  well  understood  among  the  optical  community  for  the  case  in  which  the 
source  beam  is  initially  axially  symmetric,  as  is  typical  of  gas  and  excimer  laser  systems. 
In  typical  semiconductor  laser  diodes,  however,  the  planar  nature  of  the  light-emitting 
heterojunction  region  often  gives  rise  to  a  diffraction-induced  beam  divergence  parallel 
to  the  junction  of  3  -  10  degrees,  and  a  corresponding  beam  divergence  perpendicular  to 
the  junction  of  20  -  60  degrees.  Comparable  procedures  and  optical  elements  for  such 
anamorphic  (non-axially  symmetric)  beams  are  more  complex,  and  are  currently  under 
development.  Also  under  development  are  a  number  of  types  of  semiconductor  diode 
lasers  that  emit  approximately  axially  symmetric  beams  suitable  for  standard  collimation 
and  filtering  systems. 

The  optical  source  array  generation  problem  has  received  considerable  attention  re¬ 
cently,  due  primarily  to  significant  interest  in  optical  interconnection  systems.  In  one 
promising  approach,  a  two-dimensional  array  of  computer  generated  and  photographically 
reduced  amplitude-encoded  Fresnel  zone  plates  has  been  used  to  form  an  8  x  8  grid  of 
microlenses  that  function  by  means  of  diffraction  (from  what  is,  practically  speaking,  a 
computer  generated  hologram  (CGH))  rather  than  refraction  [Marrakchi,  1990].  In  another 
well-developed  approach,  computer  generated  binary  phase  holograms  (so-called  Damman- 
n  gratings  [Dammann,  1971)  have  been  configured  to  form  large  grid  patterns  of  regularly 
spaced  illuminated  spots  with  predetermined  locations  and  fill  factors  [Morrison,  1989]. 
Using  this  latter  technique,  32  x  32  arrays  have  been  generated  with  both  high  throughput 
efficiencies  and  low  scattered  light  by  crossing  two  fabricated  1  x  32  grating  arrays.  In 
addition,  an  81  x  81  array  has  been  experimentally  demonstrated  by  using  two  pairs  of 
crossed  1x9  grating  arrays  in  an  optical  arrangement  that  generates  multiple  images  by 
means  of  a  convolution  operation  [McCormick,  1989].  In  both  of  these  techniques,  all  of  the 
resulting  light  beamlets  are  mutually  coherent,  as  they  derive  from  the  same  source.  This 
mutual  coherence  has  an  impact  on  the  utilization  of  such  source  arrays  for  the  generation 
of  independent  holographic  interconnection  networks,  as  described  in  the  subsection  on 
“Photonic  Interconnections”  above. 

An  interesting  alternative  to  the  generation  of  pixelated  optical  sources  by  modification 


of  the  properties  of  a  single  source  is  that  of  direct  fabrication  of  multiple  source  arrays. 
One  striking  example  is  the  recent  successful  fabrication  of  over  one  million  independent 
surface-emitting  semiconductor  diode  lasers  on  a  single  gallium  arsenide  chip  [Jewell.  1990]. 
Both  cylindrical  and  square  cross-section  microlasers  have  been  fabricated  with  diameters 
and  edge  dimensions  in  the  range  1  -  5  pm,  with  heights  above  the  surface  of  the  wafer  of 
about  5.5  pm  as  shown  schematically  in  Figure  15.18.  In  the  fabrication  process  employed, 
the  laser  mirrors  are  arranged  to  generate  laser  emission  through  the  500  pm  thick  gallium 
arsenide  substrate,  as  shown  in  the  Figure.  In  order  to  accomplish  this  without  significant 
absorption  in  the  substrate,  the  active  (lasing)  medium  is  composed  of  InGaAs  quantum 
wells  with  GaAs  barriers,  giving  rise  to  an  emitted  infrared  wavelength  («  950  nm )  that 
lies  in  a  region  of  substrate  transparency. 


FIGURE  15.18  Illustration  of  a  surface-emitting  laser  diode  source  array 
[after  Jeweli,  1990].  In  this  example,  the  individual  semiconductor  laser  diodes 
are  isolated  by  chemically  assisted  ion  beam  etching  techniques,  must  be  indi¬ 
vidually  contacted,  and  emit  through  the  GaAs  substrate. 


In  the  present  configuration,  the  lasers  are  essentially  optically  isolated,  and  hence  are 
not  designed  to  be  mutually  coherent  (phase  locked).  In  fact,  over  time  constants  typical 
of  holographic  recording  in  currently  available  photorefractive  crystals  (milliseconds),  it  is 
likely  that  such  arrays  are  for  all  practical  purposes  mutually  incoherent ,  due  both  to  the 
optical  isolation  as  well  as  to  process-induced  variations  in  device  parameters  that  alter  the 
wavelength  emitted  from  each  individual  laser.  Arrays  of  surface-emitting  semiconductor 
lasers  that  have  been  specifically  designed  to  have  uniformly  separated  wavelengths  have 
also  been  demonstrated  [Chang-Hasnain,  1990].  We  shall  return  to  this  characteristic 
in  a  succeeding  section  that  addresses  a  particular  strategy  for  photonic  neural  network 
implementation. 

At  present,  each  laser  within  the  array  operates  at  a  threshold  voltage  of  about  10 


volts  at  a  threshold  current  of  a  few  milliamperes,  resulting  in  a  power  dissipation  of  10 
-  50  milliwatts  per  device  at  threshold,  and  higher  for  power  outputs  significantly  above 
threshold.  In  order  to  keep  the  overall  power  density  within  established  limits  (1-10 
W/crn2)  and  thus  to  keep  from  overheating  the  substrate  (resulting  in  potentially  delete¬ 
rious  effects  on  wavelength  stability  and/or  catastrophic  failure),  the  lasers  must  either  be 
spaced  appropriately,  operated  in  a  pulsed  (on/off)  mode  at  less  than  unity  duty  cycle,  or 
temporally  multiplexed  (turning  on  only  a  few  lasers  at  a  time)  by  resorting  to  individual 
rather  than  parallel  addressing.  Given  the  current  rate  of  progress  in  the  development  of 
these  and  other  types  of  surface-emitting  laser  arrays,  it  is  reasonable  to  expect  demon¬ 
stration  of  continuous  operation  of  up  to  104  microlasers  per  square  centimeter  within  the 
near  future. 

It  should  be  noted  that  the  array  shown  in  Figure  15.18  is  not  currently  configured 
for  parallel  operation  of  all  of  the  sources  simultaneously,  which  would  require  electrical 
contact  to  the  tops  of  each  selectively  etched  microcavity.  This  feature  could  likely  be 
provided  by  an  additional  surface  passivation  and  metallization  step.  Matrix-addressable 
surface-emitting  laser  arrays  have  recently  been  fabricated  by  forming  columns  of  lasers 
separated  by  etched  isolation  grooves,  and  interconnected  across  the  grooves  by  striped  row 
contacts  [Orenstein,  1990a],  Application  of  an  appropriate  bias  voltage  across  a  given  pair 
of  electrodes  (column  and  row)  activates  the  laser  diode  at  the  intersection,  allowing  for 
raster-scanned  operational  modes  as  well  as  fully  parallel  operation  [Von  Lehmen,  1990]. 

Other  currently  investigated  approaches  to  surface-emitting  laser  array  fabrication  use 
various  techniques  to  form  the  microlaser  cavities  within  the  planar  substrate  without  the 
need  for  deep  etched  isolation  grooves,  such  as  by  the  use  of  ion  implantation  to  form 
electrically  insulating  isolation  layers  between  the  laser  cavities  [Tai,  1989a:  Orenstein. 
1990b]  or  by  the  current  confinement  that  results  from  photolithographic  definition  of 
one  of  the  two  laser  mirrors  and  its  associated  electrical  contact  [Tai,  1989b],  Fabrication 
processes  that  yield  planar  or  quasi-planar  device  structures  allow  for  direct  parallel  contact 
if  desired  without  the  complications  of  depositing  contacts  on  vertical  sidewalls. 

Before  leaving  the  subject  of  semiconductor  laser  diodes  and  surface-emitting  laser  ar¬ 
rays,  it  is  worthwhile  to  note  a  very  useful  feature  of  such  devices:  their  capacity  for  high 
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bandwidth  direct  modulation.  By  this  we  mean  that  the  output  intensity  of  the  semicon- 
*  ductor  laser  source  can  he  modulated  (at  full  modulation  depth,  i.e.  from  well  below  the 

threshold  for  lasing  to  peak  output  power)  at  frequencies  up  to  a  few  gigahertz  by  direct 
variation  of  the  voltage  applied  across  the  device.  This  attribute  can  be  used  to  advantage 
^  in  many  neuro-optical  implementation  architectures  bv  eliminating  the  need  for  mechani¬ 

cal  or  electrooptical  shutters,  as  well  as  by  offering  temporal  multiplexing  as  an  additional 
degree  of  freedom  for  the  systems  designer. 

One  additional  type  of  solid  state  device  that  is  capable  of  both  single  source  and  source 
>  array  fabrication  is  the  light  emitting  diode  (LED).  Closely  related  to  the  semiconductor 

laser  diode,  the  LED  is  also  a  p-n  junction  device  that  can  be  fabricated  with  considerably 
less  processing  complexity  by  elimination  of  the  high  reflectivity  mirrors  that  form  the 
semiconductor  laser  cavity.  An  additional  advantage  is  the  lack  of  ?  threshold  for  oper¬ 
ation.  allowing  the  LED  to  emit  over  a  much  wider  dynamic  range  of  applied  voltages. 
One  drawback  of  light  emitting  diodes  is  that  they  are  relatively  broadband  (incoherent) 
sources,  and  as  such  are  not  usable  as  sources  for  holographic  recording  applications  (and 
)  in  many  cases  for  readout  of  multiplexed  holographic  optical  elements  as  well).  In  ad¬ 

dition,  they  are  relatively  inefficient  emitters  with  typical  electrical-to  optical  conversion 
efficiencies  of  a  few  percent.  This  feature  tends  to  make  LEDs  rather  power  consumptive 
for  a  given  amount  of  usable  output  intensity. 


Detectors  and  Detector  Arrays 

> 

Detec  ors  are  optoelectronic  components  that  act  as  photon-to-electron  converters,  in 
that  they  transform  incident  optical  intensity  into  electronic  form,  usually  a  voltage  or  a 
current.  Detectors  therefore  allow  the  optical  representation  of  neuron  unit  outputs,  for 
►  example,  to  be  converted  into  an  electronic  representation  for  further  processing.  As  such. 

they  are  important  components  for  the  photonic  implementation  of  neural  networks  in  at 
least  two  functional  areas:  (a)  as  input  transducers  for  the  necessary  optical  detection 
function  of  optically  addressed  spatial  light  modulators,  and  (b)  as  output  transducers 
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for  the  translation  of  optically  generated  intermediate  and  final  results  to  an  appropriate 
electronic  format.  After  all.  once  you've  gone  to  all  of  the  trouble  of  learning  and  computing 
with  a  neural  network,  it  might  prove  worthwhile  occasionally  to  actually  get  the  answer 
out  and  use  it  to  initiate  some  other  useful  process! 

In  both  of  these  functional  amas.  we  can  further  categorize  detectors  as  (a)  single  pixel 
detectors,  and  (b)  interconnected  detector  arrays.  In  the  first  categorv,  we  include  both 
single  element  detectors  that  have  one  optical  input  aperture  and  one  output  channel,  as 
well  as  the  single  pixel  detectors  employed  as  part  of  an  array  in  two-dimensional  spatial 
light  modulators.  This  latter  assignment  is  made  because  even  though  detectors  used  in 
spatial  light  modulators  are  perhaps  configured  in  an  array,  their  outputs  are  used  only 
within  one  or  at  most  a  few  local  pixels.  In  the  second  category,  we  include  arrays  of 
detector  elements  that  are  interconnected  in  such  a  way  that  the  entire  parallel  (one- 
or  two-  dimensional)  array  can  be  read  out  electronically  through  one  or  more  output 
channels.  An  example  of  a  detector  array  in  this  category  might  be  the  light  sensitive 
element  in  the  CCD  (charge-coupled-device)  camera,  now  commonplace  in  many  solid 
state  cameras  and  video  cassette  recorders. 

This  distinction  between  single  pixel  detectors  and  detector  arrays  is  important  because 
the  technologies  that  are  commonly  employed  in  these  two  cases  differ  in  a  number  of 
respects,  and  as  a  result  can  exhibit  wide  differences  in  performance  characteristics  such  as 
bandwidth,  sensitivity,  linearity,  and  dynamic  range.  In  the  case  of  single  pixel  detectors, 
for  example,  it  proves  easier  to  jointly  optimize  performance  parameters  because  of  the 
larger  number  of  degrees  of  freedom  available  to  the  device  designer  in  a  single  input,  single 
output  system.  The  detector  array  designer,  on  the  other  hand,  often  must  make  additional 
tradeoffs  dictated  by  the  nature  of  the  charge  storage  and  readout  process  employed  over 
the  full  set  of  integrated  pixels. 

In  the  context  of  photonic  neural  network  implementations,  single  pixel  detectors  have 
two  primary  functions.  The  first  is  to  act  as  optical  signal  to  electronic  signal  converters 
within  optically  addressed  spatial  light  modulators,  to  translate  a  pixel  s  worth  of  incident 
fight  intensity  (representing,  for  example,  the  weighted  sum  of  signals  from  the  output  of 
a  plane  of  neuron  units)  into  a  voltage  or  current.  The  resultant  electronic  signal  can  then 


be  processed  by  local  intrapixel  circuitry  to  produce  the  desired  neural  threshold  func¬ 
tion  for  subsequent  optical  encoding  (modulation).  This  process  could  be  accomplished 
either  onboard  a  monolithic  or  hybrid  integrated  optically  addressed  spatial  light  modula¬ 
tor  (OASLM),  or  on  a  separate  detector  chip  that  interfaces  with  an  electrically  addressed 
spatial  light  modulator  (EASLM).  In  this  latter  case,  the  detector  will  most  likelv  fall 
under  the  detector  array  category  discussed  further  below,  since  a  parallel-to-serial  con¬ 
version  is  typically  required  to  extract  the  array  of  data  [e.g..  an  image)  from  the  detector 
chip  (followed  by  a  serial-to-parallel  conversion  to  load  the  signal  into  the  EASLM).  It 
should  be  noted  that  even  in  the  case  of  monolithic  spatial  light  modulators  that  do  not 
feature  discrete  detectors,  electronic  control  circuitry,  and  modulators,  converting  a  two- 
dimensional  optical  input  distribution  into  a  modified  two-dimensional  output  distribution 
necessarily  involves  a  local  detection  function,  even  if  it  is  not  particularly  easy  to  separate 
the  detection  process  from  the  modulation  process. 

The  second  important  single  pixel  detector  function  is  to  provide  for  single  point  mon¬ 
itoring  functions  within  the  system,  such  as  the  output  power  from  a  given  laser  source, 
the  average  power  emitted  from  a  laser  source  array,  or  a  particular  system  output  that 
activates  a  desired  process  or  function  (for  example,  the  identification  of  a  specific  defect 
pattern  on  a  manufactured  part  within  the  input  image  field  of  a  neural  image  processor, 
that  in  turn  results  in  rejection  of  the  part). 

Perhaps  the  simplest  type  of  detection  element  that  can  be  incorporated  in  a  single 
pixel  is  the  photoconductor,  which  typically  consists  of  a  thin  film  of  material  that  alters 
its  resistance  to  electrical  current  in  response  to  the  intensity  of  incident  illumination. 
The  most  commonly  used  single  pixel  photodetectors,  however,  are  based  in  some  way  or 
other  on  the  semiconductor  p-n  junction  diode.  Under  reverse  bias  in  a  p-n  junction  diode, 
photocarriers  created  by  light  absorbed  within  the  region  of  the  junction  between  n-tvpe 
and  p-type  semiconductor  layers  are  swept  out  of  the  junction  region  by  the  internal  electric 
field  across  the  junction,  and  collected  in  the  external  circuit.  If  the  internal  electric  field 
is  high  enough,  each  photocarrier  can  acquire  enough  energy  during  sweepout  to  generate 
an  avalanche  of  additional  carriers,  leading  to  significant  gain  in  the  class  of  so-called 
avalanche  photodiodes. 
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The  inclusion  of  an  intrinsic  (undoped  or  compensated)  layer  of  semiconductor  mate¬ 
rial  between  the  n-type  and  p-type  layers  allows  for  a  significant  reduction  in  the  junction 
capacitance  of  the  device,  with  a  corresponding  improvement  in  signal  bandwidth.  Such 
devices  are  commonly  referred  to  as  p-i-n  photodiodes,  packaged  versions  of  which  are 
commercially  available  for  a  wide  variety  of  photosensor  functions.  Typical  p-i-n  photo¬ 
diodes  exhibit  risetimes  of  a  few  nanoseconds,  are  linear  in  output  over  seven  orders  of 
magnitude  of  input  intensity,  and  are  sensitive  to  very  low’  light  intensity  levels.  For  silicon 
p-i-n  photodiodes,  sensitivities  of  about  0.4  milliamperes  of  output  current  per  milliwatt  of 
optical  input  power  at  a  wavelength  of  830  nm  are  common,  which  represents  a  conversion 
efficiency  from  photons  to  electrons  of  approximately  60%. 

Phototransistors  are  light  sensitive  devices  that  exhibit  current  gain  in  exactly  the  same 
manner  as  a  transistor,  with  the  exception  that  the  controlling  base  current  is  injected  op¬ 
tically  rather  than  through  the  base  lead.  In  fact,  most  VLSI  transistors  (both  bipolar  and 
MOS)  are  photosensitive  (though  perhaps  not  optimized  for  the  photodetector  role),  and 
must  be  protected  from  stray  light  in  order  not  to  compromise  their  performance  character¬ 
istics.  The  principal  advantage  of  a  phototransistor  is  its  inherent  current  gain  of  order  100 
to  1000,  which  often  makes  the  interface  of  the  photodetector  to  following  circuitry  more 
straightforward.  In  cases  requiring  exceptionally  high  gain  in  the  front  (photodetection) 
end,  two  transistors  can  be  paired  as  shown  in  Figure  15.19  so  that  one  acts  as  a  phototran¬ 
sistor,  and  the  other  as  a  current  amplifier.  Such  a  two  transistor  combination  has  achieved 
widespread  use,  and  is  referred  to  as  a  photo- Darlington  pair  [Sze,  1981b].  The  tradeoffs 
for  increased  gain  in  both  of  these  cases  are  risetime  (which  translates  directly  into  signal 
bandwidth)  and  area  required  for  integration.  Typical  risetimes  for  phototransistors  are 
almost  three  orders  of  magnitude  higher  (a  few  microseconds)  than  those  characteristic 
of  p-i-n  photodiodes.  Photo-Darlingtons  are  yet  another  factor  of  ten  or  so  slower  in  re¬ 
sponse  time.  Optimized  phototransistors  and  photo-  Darlingtons  require  relatively  large 
collector-base  junctions  in  order  to  provide  an  appropriately  sized  photosensitive  region 
that  can  be  accessed  by  optical  imaging  techniques. 


> 
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FIGURE  15.19  Schematic  diagram  of  a  photodarlington  pair  utilized  as  a 
high  gain  detector/amplifier  combination. 


In  many  if  not  most  cases,  the  type  of  photodetector  chosen  for  use  as  a  single  pixel 
detector  in  a  spatial  light  modulator  application  depends  on  its  integrability  with  associ¬ 
ated  control  electronics  and  molulation  elements.  This,  in  turn,  depends  on  whether  the 
particular  spatial  light  modulator  in  question  is  monolithically  or  hybrid  integrated,  as 
discussed  in  the  section  on  photonic  switching  above,  and  on  which  semiconductor  sub¬ 
strate  the  photodetection  element  itself  is  to  be  fabricated.  In  some  cases,  the  desire  for 
integration  of  a  high  density  of  neuron  units  may  place  strict  bounds  on  the  area  allocated 
to  each  separate  function  in  general,  and  on  the  photodetection  and  requisite  amplification 
function  in  particular. 

In  traditional  applications  of  photodetector  technology,  for  example  in  spectroscopy 
and  optical  metrology,  linearity  of  response  (output  voltage  or  current  as  a  function  of  the 
input  intensity)  is  prized,  as  is  a  wide  dynamic  range  over  which  linearity  is  assured.  In 
neural  network  applications,  however,  linearity  is  typically  less  of  an  issue.  In  fact,  it  is 
often  convenient  to  use  the  inherent  nonlinearity  of  the  input-output  characteristic  of  a 
particular  photodetector  device  to  generate  part  or  all  of  the  nonlinearity  required  of  the 
overall  neural  unit  function.  This  can  result  in  a  lower  overall  expenditure  of  real  estate 
for  each  neuron  unit,  increasing  the  neuron  array  density,  as  well  as  in  a  reduction  of 
circuit  complexity  within  each  pixel.  One  such  example  is  the  output  current  saturation 
characteristic  of  phototransistors  at  high  input  intensities,  which  can  be  used  to  emulate 
the  upper  saturation  regime  of  the  sigmoidal  neuron  response  function. 

Detector  arrays  are  employed  whenever  the  intensity  distribution  of  a  one-  or  two- 
dimensional  image  field  requires  conversion  to  electronic  form  for  interface  with  succeeding 
computational  or  output  stages  of  the  system.  In  a  very  real  sense,  a  two-  dimensional 
detector  array  is  nothing  more  than  the  business  end  of  an  optoelectronic  camera  that 
can  be  positioned  anywhere  within  the  optical  system  that  the  local  intensity  distribution 
represents  a  desired  result.  In  fact,  low  reflectivity  beamsplitters  can  be  used  to  merely 


"sample”  the  local  intensity  distribution  of  a  given  beam  of  light,  allowing  most  of  the 
incident  light  to  propagate  in  a  further  computational  arm  of  the  optical  train  for  use 
elsewhere. 

Detector  arrays  are  inherently  different  in  at  least  one  key  respect  from  the  single  pixel 
photodetectors  (as  well  as  arrays  of  photodetectors  used  in  optically  addressed  spatial  light 
modulators)  discussed  previously:  the  need  to  provide  for  some  form  of  output  channel 
multiplexing,  in  order  to  avoid  the  requirement  for  a  one-to-one  correspondence  between 
pixels  in  the  array  and  output  pins.  For  example,  in  a  1000  x  1000  element  detector 
array,  fully  parallel  readout  requires  one  million  output  channels  or  pinouts.  As  a  result, 
detector  arrays  are  usually  configured  to  perform  some  form  of  parallel-to-serial  conversion 
of  the  data  into  a  single  high  bandwidth  serial  channel  prior  to  the  readout  of  each  frame 
(though  multiple  output  channels  can  also  be  used).  I  his  can  either  add  significant  circuit 
complexity  to  the  area  surrounding  each  pixel  in  order  to  accommodate  for  the  parallel-to- 
serial  conversion  and  interpixel  communication  function,  or  be  directly  incorporated  into 
the  design  of  the  photodetection  elements  themselves,  as  in  the  case  of  the  CCD  arrays 
discussed  below. 

The  state  of  the  art  of  detector  arrays  has  advanced  tremendously  even  over  the  past 
decade,  to  the  point  where  solid  state  detector  arrays  with  quite  spectacular  performance 
are  used  everywhere  from  astronomical  applications  (as  detectors  for  even  the  largest  tele¬ 
scopes),  to  earth  observation  satellites  (infrared  focal  plane  arrays),  to  photomicroscopy  (in 
place  of  the  traditional  film-based  photographic  camera),  to  consumer  products  (electronic 
still  photography  and  video  cameras). 

One  of  the  most  successful  and  generally  available  types  of  solid  state  detector  ar¬ 
ray  is  the  charge-coupled-device  (or  CCD)  array  [Sze,  1981c;  Optical  Engineering ,  1987a; 
Optical  Engineering ,  1987b],  In  this  technology,  usually  based  on  MOS  fabrication  tech¬ 
niques  in  silicon  (but  adaptable  to  compound  semiconductor  substrates  as  well),  incident 
illumination  within  a  given  pixel  causes  the  accumulation  of  photogenerated  charge  in  an 
electrostatic  potential  well  formed  by  the  application  of  bias  voltages  on  a  set  of  electrodes, 
as  shown  schematically  in  Figure  15.20.  In  operation,  the  CCD  array  is  illuminated  for 
a  given  exposure  time  (slightly  less  than  one  full  frame  interval),  during  which  time  the 
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charge  generated  by  the  incident  illumination  is  integrated  within  each  primary  well.  Sub¬ 
sequently.  appropriate  voltages  are  applied  by  means  of  multi-phased  electrode  structures 
to  spill  the  accumulated  charge  packet  into  the  neighboring  well,  while  simultaneously 
moving  the  charges  in  the  neighboring  well  to  its  neighboring  well,  and  so  on  throughout 
the  array. 


FIGURE  15.20  Schematic  diagram  of  a  charge  coupled  device  (CCD)  pho¬ 
todetector  array  fabricated  on  a  silicon  substrate.  Electrostatic  potential  wells 
are  created  by  application  of  appropriate  voltages  to  the  three  phase  bias  elec¬ 
trode  structure,  with  electrical  isolation  provided  by  the  gate  oxide  layer.  Light 
incident  through  the  transparent  electrodes  creates  stored  charge  that  can  be 
transferred  to  an  output  signal  terminal  by  proper  sequential  phasing  of  the 
bias  voltages  ( Px  -  P3). 


The  overall  operation  resembles  the  function  of  an  array  of  one-dimensional  shift  reg¬ 
isters.  At  one  edge  of  the  structure,  the  charge  packets  from  each  row  are  collected  into  a 
single  column  that  is  read  out  by  a  very  high  speed  shift  register  (a  linear,  usually  buried 
channel  CCD  array)  to  form  the  single  output  channel.  Full  readout  of  the  array  must 
occur  before  the  next  frame  is  exposed  (except  in  specifically  designed  cases  such  as  the 
time-delay-and-integrate  or  TDI  mode  of  operation,  in  which  only  one  shift  is  interposed 
between  successive  exposures). 

One-dimensional  arrays  of  CCD  elements  have  been  successfully  fabricated  in  sizes 
of  1  x  2048,  while  special  purpose  two-dimensional  CCD  imaging  arrays  2048  x  2048  in 
size  are  commercially  available  [Blouke,  1987).  This  represents  a  parallel  detector  with 
4,194,304  individual  pixels!  In  one  particular  2048  x  2048  CCD  array,  the  imaging  area  is 
5.5  x  5.5  centimeters,  with  a  pixel  size  of  27  x  27  microns.  This  array  exhibited  a  dark 
(unilluminated)  noise  buildup  in  each  pixel  of  only  6  to  12  electrons  when  read  out  at  a 
rate  of  50,000  pixels  per  second,  which  allows  for  detection  of  extremely  low  level  signals 


with  excellent  signal-to-noise  ratio.  Given  a  well  capacity  of  about  700.000  electrons,  this 
very  low  noise  figure  suggests  a  dynamic  range  in  excess  of  70,000,  nearly  five  orders  of 
magnitude!  For  well  charge  densities  less  than  200.000  per  pixel,  the  linearity  is  better 
than  0.5%  over  this  portion  of  the  full  dynamic  range.  Finally,  this  array  exhibited  an 
extraordinarily  high  charge  transfer  efficiency  of  0.999992,  representing  the  fraction  of 
charge  within  a  given  pixel  that  is  routinely  transferred  to  an  adjacent  pixel  without  loss. 

The  integration  of  large  scale  detector  arrays  by  means  of  VLSI  techniques  provides  the 
prospect  of  special  purpose  arrays  that  perform  part  of  the  computational  function  within 
the  confines  of  the  array.  One  example  of  such  special  purpose  chips  is  the  incorporation 
in  a  CCD  array  of  charge-coupled  analog  circuitry  to  perform  arithmetic  operations  such 
as  addition,  subtraction,  and  magnitude  comparison  [Fossum.  1987],  Such  an  array  could 
allow  for  detection  of  parallel  differential  outputs,  with  both  positive  (excitatory)  and  neg¬ 
ative  (inhibitory)  weighted  sums  as  dual  optical  inputs  in  a  (positive  definite)  intensity 
representation. 


ARCHITECTURAL  CONSIDERATIONS  FOR 
PHOTONIC  NEURAL  NETWORK  IMPLEMENTATIONS 

We  now  turn  our  attention  to  the  use  of  the  photonic  components  and  fundamental 
principles  described  above  in  the  implementation  of  highly  parallel  neural  network  architec¬ 
tures.  The  focus  in  this  section  is  on  a  general  framework  that  emphasizes  characteristics 
common  to  different  approaches  to  photonic  and  optical  neural  network  implementation- 
s,  as  well  as  on  illuminating  some  of  the  key  fundamental  differences  among  the  various 
implementation  approaches.  A  review  of  recent  and  ongoing  research  in  photonic  and  op¬ 
tical  neural  network  implementations  is  beyond  the  scope  of  this  chapter;  sources  of  such 
information  can  be  found  in  the  Suggested  Further  Reading  section  at  the  end  of  this 
chapter. 

Photonic  neural  network  implementations  can  be  adaptive  or  non-adaptive,  can  repre¬ 
sent  the  signal  using  different  physical  quantities,  and  can  be  built  using  one-dimensional 
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(1-D)  or  two-dimensional  (2-D)  arrays  of  neuron  units  with  two-dimensional  or  three- 
dimensional  (3-D)  interconnection  elements.  These  issues,  in  addition  to  other  features 
that  are  desirable  in  any  photonic  implementation  of  a  neural  network,  are  discussed  in 
this  section.  Throughout,  one  should  keep  in  mind  the  distinctions  that  exist  among 
systems  with  fixed  interconnections,  programmable  systems,  and  truly  adaptive  systems. 
We  will  initially  concentrate  on  the  implementation  of  a  single  layer  of  a  network,  and 
subsequently  show  how  this  generalizes  to  multiple  layers. 

The  computation  process  of  any  one  layer  of  a  neural  network  can  be  represented  by: 


Vx=  f 


yi  u’,j  Xj 


(23) 


L  J  J 

in  which  neuron  unit  j  is  situated  at  the  input  to  the  layer  of  interconnections,  neuron  unit 
i  is  situated  at  the  output  of  the  layer  of  interconnections,  y,  is  the  output  of  neuron  unit 
i,  Xj  is  the  output  of  neuron  unit  j,  wt]  is  the  weight  associated  with  the  interconnection 
between  them,  and  the  function  /  represents  the  neuron  unit  nonlinearity.  The  term  inside 
the  brackets,  the  activation  potential,  will  be  denoted  by  p,.  Note  that  the  term  in  brackets 
is  a  matrix-vector  product  between  an  interconnection  weight  matrix  and  an  input  vector. 
The  function  /  then  operates  independently  on  each  element  of  the  resulting  vector;  this  is 
called  a  point  nonlinearity ,  and  as  such  lends  itself  to  implementation  with  a  spatial  light 
modulator  (SLM). 

Most  current  learning  algorithms  fall  into  one  of  a  small  number  of  classes.  For  example, 
one  such  class  can  be  specified  by: 


Au>u  =  aSiXj  -  (3w,:  (24) 

in  which  Ateq  =  wt]{k  +  1)  —  wtJ(k)  is  the  weight  update,  k  represents  the  iteration  index. 
q  is  the  learning  gain  constant,  and  3  is  a  decay  constant  that  is  included  primarily  for 
hardware  convenience;  3  can  be  set  to  0  when  so  desired.  Suitable  choices  of  6,  give 
different  learning  algorithms,  such  as  Hebbian,  Widrow-Hoff,  single-layer  least  minimum 
squares  (LMS),  and  for  the  case  of  multilayer  networks,  backward  error  propagation.  (For 
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example,  an  optical  architecture  that  potentially  implements  backward  error  propagation 
in  a  multilayer  neural  network  is  described  by  Wagner  and  Psaltis  [Wagner,  1987]).  In  this 
chapter  we  will  restrict  our  attention  to  this  particular  class  of  algorithms  for  illustrative 
purposes.  Although  other  classes  of  learning  algorithms  can  likely  also  be  implemented 
using  photonic  hardware,  research  to  date  has  focused  primarily  on  the  class  represented 
by  Equation  (24).  An  important  aspect  of  Equation  (24)  for  implementation  is  the  outer 
product  between  the  training  vector  <5  and  the  input  vector  x  for  the  weight  matrix  update. 

An  example  of  a  photonic  neural  system  is  shown  in  block  diagram  form  in  Figure 
15.21.  This  system  utilizes  a  1-D  array  of  neuron  units  at  the  input  and  output,  and 
a  2-D  interconnection  mask.  Each  pixel  in  the  input  is  expanded  optically  (using  cylin¬ 
drical  lenses)  and  illuminates  the  corresponding  row  of  the  interconnection  mask.  The 
mask  stores  the  analog  weights,  and  provides  a  pointwise  multiplication  before  the  beam 
is  contracted  so  that  one  column  from  the  mask  is  incident  onto  one  corresponding  output 
pixel.  The  optical  system  in  effect  provides  a  fully  parallel  analog  optical  matrix-vector 
multiplication  as  represented  by  the  bracketed  term  in  Equation  (23),  performed  over  all  i. 
Threshold  functions  and  feedback  connections  are  provided  by  means  of  either  photonics 
or  electronics.  The  first  experimental  demonstration  of  such  a  system  applied  to  neural 
network  implementations  used  an  array  of  light  emitting  diodes  (LEDs)  as  inputs  to,  and  a 
linear  detector  array  as  the  output  from,  the  optical  interconnection  [Psaltis,  1985;  Farhat, 
1985].  This  particular  system  utilized  electronics  to  provide  the  threshold  functions  and 
feedback  connections. 


FIGURE  15.21  Block  diagram  of  a  1-D  to  1-D  photonic  neural  network, 
in  which  a  one-dimensional  neuron  array  is  fully  interconnected  to  a  one¬ 
dimensional  detector  array  by  means  of  a  two-dimensional  interconnection 
mask. 


It  should  be  noted  that  many  variants  of  Figure  15.21  are  possible;  some  of  them  are 


more  compact  than  others,  though  all  of  them  share  essentially  the  same  basic  characteris¬ 
tics.  The  interconnection  mask  can  be  fixed  ( e.g .,  photographic  film)  or  variable  (e.g..  an 
SLM).  In  the  latter  case  the  SLM  can  be  electronically  or  optically  addressed.  Electron¬ 
ic  addressing  is  appropriate  for  straightforward  interfacing  to  an  electronic  machine  that 
supplies  the  (updated)  interconnection  weights,  whereas  for  a  maximum  adaptation  rate 
an  optical  addressing  technique  would  ultimately  be  optimal.  Currently  available  SLMs 
with  large  numbers  of  pixels  tend  to  be  slow  (500  x  500  analog  pixels  with  1  -  100  ms 
frame  times)  [Tanguay.  1985];  much  faster  technologies  are  being  developed  for  future  use 
[see,  for  example,  Lentine,  1988;  Lentine,  1991:  McCormick,  1989b].  Such  a  system,  with 
1-D  inputs,  1-D  outputs,  and  2-D  interconnections,  will  likely  scale  up  to  100  -  1000  fully- 
connected  neuron  units. 

A  photonic  system  that  can  implement  larger  numbers  of  neuron  units  and  intercon¬ 
nections  is  shown  in  Figure  15.22.  All  neuron  unit  planes  are  now  2-D  arrays,  and  the 
interconnection  medium  is  a  3-D  structure,  implemented  in  a  volume  holographic  mate¬ 
rial.  In  effect,  there  is  a  separate  volume  grating  connecting  each  input  neuron  unit  j  to 
each  output  neuron  unit  i.  The  diffraction  efficiency  of  each  grating  is  proportional  to  the 
weight,  wXJ,  of  the  corresponding  interconnection.  Note  that  each  such  grating  is  analogous 
to  a  beamsplitter,  as  discussed  in  the  previous  section,  with  the  primary  difference  that  the 
volume  gratings  are  direction  (and  wavelength)  selective.  Thus,  beams  incident  on  such 
a  “beamsplitter”  at  other  than  the  correct  angle  are  not  affected  by  the  presence  of  the 
holographic  beamsplitter.  Properly  recorded,  then,  the  grating  u>v  is  situated  in  angular 
orientation  and  grating  period  so  that  it  affects  only  the  inputs  at  the  angle  corresponding 
to  Xj  ,  and  will  direct  the  corresponding  output  WijXj  to  the  correct  summation  node  pt. 
The  achievable  numbers  of  neuron  units  and  interconnections  are  currently  subjects  of 
considerable  debate,  but  would  likely  be  104  -  106  neuron  units  per  plane  and  on  the  order 
of  1010  independent  interconnections  with  weights,  assuming  continued  research  unveils  no 
impassable  boundaries. 


FIGURE  15.22  Block  diagram  of  a  2-D  to  2-D  photonic  neural  network. 


in  which  a  two-dimensional  neuron  array  is  fully  interconnected  to  a  two- 
dimensional  output  array  by  means  of  a  three-dimensional  volume  holographic 
optical  interconnection  mask.  The  input  plane,  output  plane,  and  optional 
training  plane  are  shown.  Many  variants  of  this  geometry  with  similar  proper¬ 
ties  are  possible. 


For  the  case  of  an  adaptive  network,  we  use  a  variable  (typically  photorefractive)  holo¬ 
graphic  material  for  recording  and  implementing  the  interconnections.  To  incorporate 
learning,  a  training  plane  comprising  a  2-D  array  of  nodes  generates  the  S,  terms  (Figure 
15.22).  Dur;ng  a  weight  update,  an  exposure  is  made  of  the  interference  pattern  between 
beams  emanating  from  the  two  left  hand  planes  in  the  figure.  Each  of  the  two  left  hand 
planes  could  be  implemented  using,  for  example,  a  2-D  spatial  light  modulator  illuminated 
by  an  expanded  beam.  This  results  in  a  change  in  the  refractive  index  modulation  repre¬ 
senting  the  current  weight  that  is  dependent  on  the  product  6,x },  so  that  with  appropriate 
choices  of  parameters,  the  increment  in  diffraction  efficiency  can  be  made  proportional 
to  6,Xj.  Ideally,  this  records  changes  (updates)  in  the  interconnection  weights  within  the 
hologram  given  by  Equation  (24),  above,  in  the  form  of  gratings  situated  with  appro¬ 
priate  angular  orientation  and  grating  period.  It  should  be  noted  that  generating  and 
recording  these  weight  updates  is  not  a  simple  matter,  and  care  must  be  taken  to  insure 
that  the  appropriate  interference  terms  are  recorded  and  that  not  too  much  crosstalk  is 
inadvertently  created.  Recording  and  recall  of  the  correct  values  is  primarily  a  number 
representation  issue  and  is  discussed  below;  undesirable  crosstalk  depends  on  the  record¬ 
ing  and  reconstruction  technique  as  previously  discussed  in  the  subsection  on  “Photonic 
Interconnections” . 

An  example  of  one  source  of  holographically-induced  interconnection  crosstalk  is  an 
inadvertent  degeneracy  of  gratings.  Even  though  each  volume  grating  affects  only  the 
beams  incident  at  a  particular  angle  with  respect  to  the  grating,  it  affects  all  of  the  beams 
at  that  particular  angle.  Because  of  this,  an  entire  cone  of  beams  (with  its  axis  of  symmetry 
aligned  with  the  grating  wave  vector)  can  be  affected  by  a  single  diffraction  grating.  This 


degeneracy  creates  an  undesired  coupling  between  different  interconnections  in  a  fullv 
connected  network.  For  neuron  unit  sources  on  an  ideal,  rectangular  grid,  this  coupling  can 
be  eliminated  by  removing  neuron  units  from  certain  locations  in  the  array,  leaving  sparsely 
distributed  neuron  units  arranged  in  a  degeneracy  breaking  pattern.  This  eliminates  the 
undesired  coupling,  at  the  expense  of  a  reduction  in  the  number  of  neuron  units  from  A* 
(for  an  .V  x  .V  array)  to  A'1'5  [Psaltis.  1989], 

The  case  of  a  non-adaptive  network  is  likely  to  be  an  important  one  as  well.  In  this  cast' 
the  interconnection  hologram  does  not  have  to  be  recorded  in  accordance  with  a  specific 
learning  algorithm.  If  the  weights  are  known  a  prion,  then  any  applicable  recording 
technique  will  suffice.  In  many  cases,  however,  the  weights  may  not  be  known.  A  common 
scenario  may  involve  the  training  of  a  "master"  network:  once  it  has  been  trained,  copies 
of  the  network  could  be  produced  in  a  production  environment.  If  the  network  is  large, 
and  particularly  if  it  utilizes  volume  holographic  optical  interconnections,  then  probing  the 
values  of  all  of  the  weights  could  be  impractical.  The  most  efficient  production  means  in 
this  case  would  be  to  make  direct  copies  of  the  volume  hologram.  Thus,  the  capability  of 
rapidly  copying  a  multiplexed  volume  interconnection  hologram  is  important. 

The  physical  representation  of  the  signal  directly  impacts  the  operation  of  a  photonic 
neural  network.  The  physical  quantities  available  for  optical  representation  of  a  signal 
level  are  field  amplitude,  phase,  intensity,  polarization,  spatial  position  or  frequency,  and 
wavelength.  We  will  consider  only  the  most  likely  candidates:  field  amplitude  (with  phase) 
and  intensity.  For  the  case  of  an  amplitude  (with  phase)  representation,  the  signals  may  in 
general  be  complex  valued;  bipolar  signals,  of  course,  represent  a  subset  of  these  numbers, 
and  thus  can  be  represented.  Given  that  x  and  y  are  represented  as  (electric  or  magnetic) 
field  amplitudes,  the  resulting  detected  activation  potential  of  neuron  unit  i.p,.  is  given  by 

P,<C°h)  =  iZ^Xjl2  (25) 


for  the  case  of  a  coherent  sum,  and  by 
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for  the  case  of  an  incoherent  sum  {c.f.  the  preceeding  section  on  "Fundamental  Principles 
of  Photonic  Technology'’).  In  both  Equations  (25)  and  (26),  the  weight  wv  is  represented 
physically  by  the  amplitude  diffraction  efficiency.  The  coherent  sum  given  by  Equation 
(25)  has  the  advantage  of  allowing  for  the  addition  of  both  positive  and  negative  numbers 
in  computation  of  the  neuron  unit  potential,  as  desired  for  the  incorporation  of  both 
excitatory  and  inhibitory  neuron  unit  inputs.  Clearly,  Equations  (25)  and  (26)  deviate 
from  conventional  neural  network  models.  The  effects  on  different  neural  network  models 
of  such  deviations  in  the  summation  before  thresholding  are  not  currently  well  understood. 

If  we  instead  encode  the  signal  levels  as  intensities,  the  activation  potential  becomes 


p-mt)  = 


which  is  the  desired  activation  potential,  but  at  the  expense  of  all  terms  in  the  summation 
being  nonnegative.  In  this  case  the  weight  wtJ  is  represented  physically  by  the  intensity 
diffraction  efficiency.  A  technique  for  effectively  achieving  bipolar  signals  in  this  case  will 
be  discussed  in  the  section  describing  “An  Implementation  Strategy". 

The  signal  representation  used  also  impacts  the  nature  of  the  weight  updates.  The 
physical  weight  updates  can  be  derived  using  common  models  of  photorefractive  (or  other) 
recording  materials.  Such  a  derivation  requires  a  number  of  approximations  and  assump¬ 
tions  to  be  made  regarding  the  chosen  operational  mode.  By  appropriate  choice  of  the 
operational  mode,  the  ideal  weight  update  rule  given  by  Equation  (24)  can  be  approxi¬ 
mately  obtained  for  both  intensity  representation  and  amplitude  representation  cases.  The 
operational  mode  may  not  prove  to  be  the  same  in  each  case,  and  may  differ  in  such  param¬ 
eters  as  the  size  of  the  weight  updates,  the  size  of  the  existing  weights  before  the  update, 
and  the  exact  characteristics  of  the  holographic  material  used.  The  “second  order"  terms 
that  deviate  from  the  precise  form  of  Equation  (24)  are  also  different  in  the  two  cases; 
the  effect  of  such  terms  on  learning  algorithm  performance  is  not  well  characterized  or 
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understood,  and  is  currently  an  active  area  of  research. 

1  So  far  we  have  discussed  only  a  single  interconnection  layer  with  neuron  units  for  in¬ 

puts  and  outputs.  If  such  a  physical  network  includes  feedback,  it  can  be  generalized  to 
functionally  implement  an  arbitrary  multilayer  feedforward  or  recurrent  network.  Figure 
15.23  illustrates  this  principle,  showing  one  physical  layer  of  neuron  units,  one  layer  of 
interconnections  from  the  neuron  units  to  a  set  of  fan-in  nodes,  and  feedback  from  each 
fan-in  node  to  the  corresponding  neuron  unit.  These  neuron  units  car.  be  conceptually 
divided  into  groups  corresponding  to  different  functional  layers.  Some  of  the  physical  in- 

>  terconnections  then  represent  functionally  feedforward  connections  (represented  by  solid 
lines  and  boxes  in  Figure  15.23),  and  some  represent  functionally  lateral  connections  within 
a  layer  (represented  by  broken  lines  and  boxes  in  Figure  15.23).  Feedback  connections  to 
previous  layers,  and  feedforward  connections  that  bypass  the  next  subsequent  layer,  can 
also  be  incorporated  in  a  similar  manner,  but  c.re  not  shown  in  the  figure.  This  technique 
for  implementing  multilayer  networks  using  a  single  physical  layer  has  been  discussed  by 
Farhat  for  the  case  of  1-D  neuron  unit  arrays  interconnected  by  a  2-D  mask,  and  used 

>  in  the  implementation  of  parallel  optoelectronic  simulated  annealing  [Farhat,  1987],  Thus 
any  photonic  (single  physical  layer)  architectures  discussed  herein  generalize  to  multilayer 
networks,  provided  that  they  have  capability  for  arbitrary  connections  and  feedback. 


FIGURE  15.23  A  single  layer  physical  neural  network  with  feedback,  used 
to  implement  a  multilayer  recurrent  functional  network.  The  solid  boxes  in¬ 
dicate  feedforward  connections,  and  the  broken  boxes  indicate  lateral  connec¬ 
tions. 


In  summary,  the  desirable  characteristics  of  a  photonic  implementation  of  neural  net¬ 
works  include:  (1)  modularity,  so  that  multiple  “modules"  can  be  cascaded;  (2)  capability 
for  lateral,  feedforward,  and  feedback  interconnections,  which  can  be  acnieved  physically 
by  use  of  a  single  layer  network  with  feedback  and  arbitrary  interconnection  capability: 
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(3)  am  >g.  weighted  connections  with  analog  signals:  (4)  bipolar  signals  and  weights:  (5) 
scalability  to  large  numbers  of  neuron  units  with  high  connectivity;  (6)  generality,  so  that 
different  nemoi.  models,  network  models,  and  learning  algorithms  can  be  implemented 
within  the  same  b^sic  technology;  (7)  compatibility  of  different  components  within  a  given 
architecture;  and  (8)  overall  feasibility  of  the  proposed  combination  of  algorithm,  archi¬ 
tecture.  devices,  and  materials.  In  addition,  the  optical/photonic  hardware  would  ideally 
incorporate  the  followi:  g  features:  (1)  simultaneous,  parallel  updates  of  all  interconnection 
weights  at  each  iteration:  (2)  high  optical  throughput;  (3)  low  interconnection  crosstalk; 
and  (4)  flexible  functionality  for  neuron  unit  response,  so  that  different  neuron  models  and 
learning  algorithms  can  be  accommodated. 

AN  IMPLEMENTATION  STRATEGY 

In  this  section  a  photonic  technique  for  the  implementation  of  neural  networks  is  de¬ 
scribed  that  potentially  satisfies  the  aforementioned  desirable  characteristics  and  features 
[Jenkins,  1990a-  \sthana,  1990a;  Jenkins,  1990b;  Asthana.  1990b;  Jenkins,  1990c],  This 
photonic  neural  network  implementation  technique  utilizes  optoelectronic  spatial  light 
modulators  (SLMs)  for  the  2-D  neuron  unit  and  training  term  planes.  Each  neuron  11- 
nit  incorporates  dual  channel  encoding  to  allow  for  the  representation  of  bipolar  input 
and  output  signals,  and  comprises  two  integrated  detectors,  «,wo  modulators  and  integrat¬ 
ed  electronics.  The  neuron  unit  input  and  output  signals  are  represented  in  the  optical 
system  by  intensity.  The  interconnections  are  based  on  a  3-D  holographic  material  with  a 
novel  incoherent/coherent  recording  and  reconstruction  technique  that  permits  simultane¬ 
ous  updates  of  all  weights  during  each  iteration.  In  addition,  the  interconnections  utilize 
a  unique  double  angular  multiplexing  arrangement  to  minimize  interchannel  crosstalk  and 
throughput  losses,  in  which  each  pixel  of  the  object  beam  SLM  is  illuminated  by  a  set  of 
mutually  incoherent  beams,  each  at  a  different  angle.  This  implementation  technique  is 
explained  in  the  remainder  of  this  section. 

A  key  feature  of  this  implementation  strategy  is  the  use  of  an  array  of  individually 


coherent  sources  that  are  mutually  incoherent  to  generate  an  array  of  coherent  beam  pairs 
used  for  holographic  recording  and  reconstruction  in  the  interconnection  network.  Consid¬ 
er  the  problem  of  recording  tw'o  holograms,  object  A  recorded  with  reference  beam  x:  and 
object  B  recorded  with  reference  beam  xy ,  as  shown  in  Figure  15.24(a).  The  objects  A 
and  B  could  each  be  a  2-D  array  of  data.  In  order  to  write  both  holograms  simultaneously. 
A  and  x:  originate  from  the  same  coherent  source  and  are  mutually  coherent;  similarly 
for  B  and  xy.  However,  B  and  xy  originate  from  a  different  source  than  .4  and  so 
that  each  pair  is  incoherent  with  respect  to  the  other  pair.  In  this  way,  there  are  no  extra 
(crosstalk)  holograms  written,  such  as  that  between  A  and  xy,  or  between  x3  and  xy.  This 
technique  can  be  used  for  more  than  two  multiplexed  holograms,  in  which  case  a  separate 
source  is  assumed  for  each  hologram  written. 


FIGURE  15.24  Incoherent/coherent  technique  for  recording  and  reconstruct¬ 
ing  multiple  holograms  simultaneously,  in  which  all  solid  lines  represent  mutu¬ 
ally  coherent  beams,  and  all  broken  lines  represent  a  separate  set  of  mutually 
coherent  beams:  (a)  recording;  (b)  reconstruction;  and  (c)  holographic  repre¬ 
sentation,  in  which  each  hologram  represents  the  fanout  from  a  given  neuron 
unit. 


During  reconstruction,  the  holograms  are  illuminated  by  the  same  set  of  reference 
beams  x}  and  xy.  This  simultaneously  reconstructs  the  arrays  A  and  B  (Figure  15.24(b)). 
If  the  arrays  are  in  registry  upon  reconstruction,  a  pixel-by-pixel  incoherent  sum  will  be 
achieved  in  the  output  array.  If  we  now  consider  each  reference  beam  x}  to  be  the  output  of 
a  neuron  unit  at  the  input  to  an  interconnection  layer,  then  each  reconstructed  hologram 
corresponds  to  the  fan-out  from  one  neuron  unit,  with  a  contribution  to  each  pixel  in 
the  output  array  proportional  to  the  weight  of  the  corresponding  interconnection.  This 
is  depicted  in  Figure  15.24(b)  and  (c),  in  which  the  two  signals  fanning  in  to  a  given 
neuron  unit  are  derived  from  separate,  mutually  incoherent  optical  sources.  Note  that  this 
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technique  provides  an  incoherent  sum  for  the  potential  of  each  neuron  unit  (Equation  (26) 
or  Equation  (27),  depending  on  the  chosen  representation),  as  desired. 

Another  critical  as  well  as  unique  feature  of  the  photonic  architecture  described  herein 
is  a  “double  angular  multiplexing”  technique  in  which  one  input  node  or  pixel  in  the  object 
beam  path  has  multiple  beams  passing  through  it  at  different  angles.  Thus,  a  set  of  an¬ 
gularly  multiplexed  beams  is  introduced  for  each  object  beam  node  <5,,  as  shown  in  Figure 
15.25.  A  three-fold  angularly  multiplexed  fan-in  from  xi.x2-  and  x3  to  yield  neuron  unit 
potential  pi  is  depicted  in  this  figure;  solid  lines  represent  mutually  coherent  b^ams  (all 
dashed  lines  represent  a  mutually  coherent  set  as  well:  similarly  for  mixed  dashed  lines). 
Note  that  this  multiplexing  technique  eliminates  the  fan-in  beam  degeneracy  characteris¬ 
tic  of  collinear  geometries  referred  to  above  in  the  subsection  “Photonic  Interconnections". 
Thus,  the  ensuing  cross-coupling  terms  are  absent,  and  a  much  more  accurate  set  of  weights 
can  be  recorded  and  reconstructed  at  each  iteration. 


FIGURE  15.25  Doubly  angularly  multiplexed  volume  holographic  optical 
interconnection,  designed  to  circumvent  the  effects  of  beam  degeneracy.  The 
mutually  incoherent  input  beams  ({xj})  are  angularly  multiplexed  over  j ,  as 
are  the  corresponding  sets  of  output  beams  from  the  training  plane  ({<5,^}) 
generated  by  the  coherent  sources  S7,  to  produce  an  angularly  multiplexed  fan- 
in  at  each  summed  output,  thus  yielding  the  neuron  activation  potentials  {p,}. 


A  photonic  architecture  for  neural  network  implementation  that  utilizes  these  princi¬ 
ples  is  shown  in  Figure  15.26,  for  the  case  of  Hebbian  learning  (6,  =  y,).  The  components 
shown  in  the  figure  comprise  one  module;  inputs  and  outputs  refer  to  this  particular  mod¬ 
ule.  Only  feedforward  connections  are  shown.  The  upper  spatial  light  modulator,  SLA1\, 
generates  the  training  terms  6 ,  that  also  represent  neuron  unit  outputs  in  this  case.  The 
lower  spatial  light  modulator,  SLM2,  is  the  array  of  input  neuron  units.  An  array  of  co¬ 
herent  but  mutually  incoherent  sources  is  used  to  illuminate  the  system;  they  are  provided 


by  a  mutually  incoherent  laser  diode  array  or  by  a  coherent  beam  passing  through  an  SLM 
that  temporally  modulates  the  phase  of  each  pixel  independently.  (It  can  be  shown  that 
the  latter  method  is  equivalent  to  the  former  for  the  particular  type  of  holographic  record¬ 
ing  and  reconstruction  used  herein.)  A  volume  holographic  material  stores  the  requisite 
weighted  interconnections,  and  can  implement  either  fixed  or  adaptive  interconnections 
depending  on  the  material  used. 


FIGURE  15.26  Photonic  architecture  for  neural  network  implementation 
that  incorporates  a  parallel  source  array,  double  angular  multiplexing,  and  in¬ 
coherent/coherent  recording  and  reconstruction;  the  Hebbian  case  is  depicted. 


Both  spatial  light  modulators  in  Figure  15.26  consist  of  an  array  of  pixels,  each  of 
which  comprises  three  elements:  (1)  two  integrated  detectors  for  input  of  positive  and 
negative  parts  of  the  neuron  unit  activation  potential,  (2)  integrated  electronic  circuitry 
to  provide  the  neuron  unit  (sigmoid  or  hard  threshold)  nonlinearity,  and  (3)  two  hybrid 
or  monolithically  integrated  modulators  for  separate  optical  readout  of  the  positive  and 
negative  neuron  unit  outputs.  The  SLMs,  as  shown,  are  read  out  in  transmission,  and 
have  detectors  situated  so  as  to  receive  optical  inputs  on  the  right  face  of  the  SLM. 

In  the  learning  phase,  the  shutter  is  open  as  shown  schematically  in  Figure  15.27.  Light 
from  each  source  Sj  is  approximately  collimated  so  that  it  illuminates  the  entire  array  on 
SLM\ ,  at  an  angle  dependent  on  the  position  of  the  jth  source.  Thus,  for  an  N  by  N  array 
of  sources,  there  are  N2  beams  reading  out  the  contents  of  SLM\  simultaneously,  each  at 
a  different  angle;  the  entire  array  of  terms  {y,}  is  encoded  onto  each  of  these  beams.  Each 
such  beam  then  interferes  only  with  its  corresponding  reference  beam  Xj,  derived  from  the 
same  source  and  encoded  by  SLM 2,  in  the  holographic  medium.  This  writes  the  set  of 
desired  weight  update  terms  ax }y,. 


FIGURE  15.27  Photonic  architecture  for  neural  network  implementation: 
recording  configuration.  This  configuration  implements  the  learning  function 
in  the  photonic  architecture  of  Figure  15.26.  The  sets  of  beams  emitted  from 
the  source  array  (two  are  shown)  interfere  in  the  volume  holographic  medium 
to  update  the  weights  stored  in  the  interconnection  holograms. 


During  the  computation  phase,  the  shutter  is  closed  to  prevent  learning  as  shown 
schematically  in  Figure  15.28.  The  array  of  sources  is  imaged  onto  SLA/2  as  a  set  of 
readout  beams,  so  that  each  individual  source  corresponds  to  one  pixel  (neuron  unit)  on 
the  SLM.  The  SLM  modulates  each  beam  so  that  the  transmitted  beam  has  an  intensity 
proportional  to  the  output  value  of  the  corresponding  neuron  unit.  Thus,  the  jth  source 
illuminates  the  jth  pixel  of  this  SLM,  providing  the  signal  that  becomes  a  reference 
beam  to  read  out  the  jth  hologram.  This  hologram  reconstructs  an  array  of  spots,  similar 
to  that  depicted  in  Figure  15.24,  that  contribute  to  the  input  of  each  neuron  unit  in  the 
output  plane.  The  optics  is  set  up  so  that  this  array  is  imaged  onto  the  detector  array. 
In  the  complete  neural  network  architecture  of  Figure  15.26,  additional  optical  elements 
(mirror  A/2,  lens  Z,4  and  beamsplitter  £S2)  are  used  to  displace  the  detector  array  plane 
to  the  detector  side  of  SLM^,  providing  the  neuron  unit  activation  potentials.  In  addition, 
this  beam  is  sent  through  beamsplitter  J9S2  to  a  subsequent  layer  in  the  next  module  or 
to  the  output  layer. 


FIGURE  15.28  Photonic  architecture  for  neural  network  implementation: 
reconstruction  configuration.  This  configuration  implements  a  single  forward 
pass  of  the  computing  function  in  the  photonic  architecture  of  Figure  15.26. 
The  lower  set  of  beams  acts  as  a  set  of  reference  beams,  and  generates  a  set  of 
weighted  output  arrays  that  are  imaged  onto  the  detector  array.  Each  stored 
hologram  is  reconstructed  by  a  single  neuron  unit  x;,  and  fans  out  with  appro¬ 
priate  weights  to  illuminate  the  detector  array.  The  full  set  of  reconstructed 
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holograms  sums  within  each  pixel  to  yield  the  neuron  activation  potentials  {p,}. 


A  generalized  architecture  that  incorporates  learning  algorithms  of  the  form  of  Equa¬ 
tion  (24)  is  shown  in  Figure  15.29.  Instead  of  SLMi,  as  in  Figure  15.26.  this  architecture 
utilizes  a  training  term  (<5;)  generator  that  is  implemented  via  one  or  more  optoelectronic 
SLMs.  In  general,  target  values  actual  neuron  unit  outputs  y,,  or  possibly  activation 
potentials  p,  may  be  provided  as  inputs  to  the  training  term  generator.  The  physical  ar¬ 
rangement  of  optical  beams  passing  through  the  training  term  generator  (from  left  to  right) 
is  the  same  as  that  shown  passing  through  SLMi  in  Figure  15.26.  Lateral  and  feedback 
connections  can  be  incorporated  by  including  an  optical  feedback  path  from  the  output  of 
the  hologram  to  the  input  side  of  SLMi. 


FIGURE  15.29  Generalized  photonic  architecture  for  neural  network  imple¬ 
mentation,  including  provision  for  the  generation  of  arbitrary  training  terms 
(«.-)■ 


For  many  applications,  both  SLM\  and  SLMi  can  be  fabricated  using  the  same  tech¬ 
nology.  Let’s  consider  the  case  of  a  sigmoidal  response  with  bipolar  inputs  and  bipolar 
outputs.  The  electronics  within  each  neuron  unit  can  take  the  difference  between  the  two 
detector  inputs  to  yield  the  (bipolar)  neuron  potential.  It  can  then  perform  the  sigmoidal 
nonlinearity,  and  send  the  result  to  appropriate  (positive  channel  or  negative  channel) 
modulator(s).  For  example,  we  have  fabricated  a  number  of  silicon  chips  that  integrate 
the  necessary  control  electronics  with  appropriate  detectors.  One  possible  circuit  that  has 
been  designed  to  incorporate  the  necessary  functionality  is  shown  schematically  in  Figure 
15.30.  Outputs  from  the  two  photodetection  stages  (V^nl  and  V'in2)  are  differentially  am¬ 
plified  using  two  pairs  of  CMOS  transistors  (Mi  —  M3,  M2  -  M4),  generating  two  separate 
and  complementary  outputs.  The  differential  amplifier  has  been  designed  to  saturate  for 


76 


large  values  of  the  input  signal  difference,  producing  the  upper  asymptotic  limit  behavior 
characteristic  of  the  sigmoid  function.  Each  output  signal  is  then  inverted  and  clipped  by 
another  CMOS  transistor  pair  (A/21  —  A/22,  A/n  —  A/12),  which  asymmetrizes  the  transfer 
curve  and  adds  the  lower  asymptotic  limit  of  the  sigmoid  function.  Finally,  each  output 
signal  is  inverted  yet  again  and  shifted  in  level  by  a  dual  transistor  sub-unity  gain  amplifier 
stage  (A/23  ~  A/24,  A/j3  —  A/]4),  producing  complementary  output  signals  (V^utl  and  VoUt2) 
that  control  the  dual  channel  modulation  elements.  External  provision  is  made  in  each 
pixel  (neuron  unit)  for  the  adjustment  of  the  voltage  offset  (zero  crossing  point)  of  each 
characteristic  curve.  This  external  bias  adjustment  allows  for  post-fabrication  fine  tuning 
of  the  overall  response  of  the  circuit,  given  process-induced  variations  in  device  character¬ 
istics. 


FIGURE  15.30  Schematic  diagram  of  a  dual-input,  dual-output  differential 
amplifier  that  effects  a  sigmoid-like  transfer  characteristic. 


A  sample  set  of  characteristic  curves  measured  from  one  of  these  chips  is  shown  in  Fig¬ 
ure  15.31,  with  the  voltage  on  the  second  detector  input  channel  as  the  parameter.  These 
curves  show  the  differential  function  of  the  dual  channel  circuit,  as  well  as  the  desired 
sigmoidal  response  characteristic.  A  6  x  6  array  of  these  100  x  100  /im  neuron  units  has 
also  been  fabricated  with  excellent  uniformity. 


FIGURE  15.31  Experimentally  obtained  transfer  characteristics  from  the 
circuit  shown  in  Figure  15.30,  showing  the  output  voltage  in  both  channels 
[Vouti  and  Voun)  as  a  function  of  one  input  voltage  (V''mi),  with  the  other  input 
voltage  ( Vin2)  as  a  parameter. 
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The  modular  nature  of  this  photonic  neural  net  architecture  can  be  inferred  from  Fig¬ 
ure  15.29;  the  upper  right  SLM  is  SZ.U2  of  the  subsequent  module.  Feedback  paths  from 
one  module  back  to  previous  modules  can  be  added,  if  desired,  in  a  relatively  straightfor¬ 
ward  manner.  Bipolar  signals  are  incorporated  by  the  dual  channel  nature  of  the  SLMs. 
with  positive  and  negative  channels  for  each  neuron  unit.  Since  each  neuron  unit  has  two 
physical  outputs  and  two  physical  inputs,  each  interconnection  between  two  neuron  units 
physically  consists  of  four  separate  weighted  connections  (positive  modulator  to  positive 
detector,  positive  modulator  to  negative  detector,  etc. ).  Thus,  even  though  each  physi¬ 
cal  weight  is  nonnegative  in  value,  their  combination  permits  effective  implementation  of 
bipolar  functional  weights.  In  fact,  the  extra  degrees  of  freedom  provided  by  four  indepen¬ 
dent  weights  can  require  revised  weight  update  rules  to  ensure  convergence  of  the  learning 
process  [Petrisor.  1990], 

With  the  current  spatial  light  modulator  design  at  100  /m?  x  100  fim  per  neuron  unit. 
104  neuron  units  per  cm2  can  be  implemented  on  each  SLM.  By  constructing  an  SLM  as 
a  mosaic  of  such  arrays,  a  3  in  x  3  in  SLM  could  implement  approximately  5  x  10s  neuron 
units.  Each  “tile”  or  small  array  within  such  a  mosaic  need  not  be  carefully  aligned  with 
respect  to  the  other  tiles,  as  the  optical  system  just  images  the  array  back  onto  itself:  in  the 
case  of  multiple  modules  or  lateral/feedback  connections,  there  is.  however,  a  requirement 
that  all  SLMs  are  similarly  tiled,  within  an  appropriate  tolerance  figure.  Note  that  the 
current  design  utilizes  only  2  fim  feature  sizes  in  CMOS;  this  could  eventually  be  scaled 
down  by  a  factor  of  4  in  each  dimension,  yielding  more  than  an  order  of  magnitude  increase 
in  the  number  of  neuron  units  implementable  per  unit  chip  area  (or  an  equivalent  reduction 
in  the  overall  size  with  the  same  number  of  neuron  units). 

It  should  now  be  clear  that  this  architecture  can  be  generalized  to  implement  certain 
other  neural  models.  The  use  of  electronic  circuitry  for  the  neuron  unit  function  and 
training  term  generation  provides  significant  inherent  flexibility.  For  example,  we  have 
completed  preliminary  designs  of  units  for  forward  and  backward  propagating  signals  in 
a  backpropagation-style  multilayer  neural  network.  Although  the  optical  weight  updates 
in  the  holographic  medium  are  restricted  to  outer-product  terms  (Equation  (24)  in  the 
architecture  as  shown,  variants  of  the  architecture  may  permit  other  learning  scenarios. 


Finally,  we  consider  the  important  question  of  making  duplicates  of  a  network  that 
has  already  been  trained.  Since  a  volume  hologram  may  store  on  the  order  of  lO10  in¬ 
dependent  weighted  interconnections,  the  preferred  technique  is  to  make  direct  copies  of 
the  multiplexed  volume  hologram.  Here  we  describe  a  technique  for  copying  such  a  mul¬ 
tiplexed  volume  hologram  in  one  step  [Jenkins,  1990c].  To  our  knowledge  this  has  never 
previously  been  achieved,  but  the  use  of  incoherent /coherent  holographic  recording  and 
reconstruction  makes  this  in  principle  quite  straightforward.  Figure  15.32  shows  an  optical 
setup  for  duplicating  the  hologram.  The  master  hologram  is  illuminated  with  the  same 
set  of  reference  beams  as  those  employed  during  exposure;  all  of  the  mutually  incoherent 
reference  beams  illuminate  the  master  volume  hologram  simultaneously,  recalling  all  of  the 
stored  holograms  in  parallel.  The  source  array  is  imaged  so  that  it  generates  an  identi¬ 
cal  set  of  reference  beams  on  the  secondary  (copy)  holographic  medium.  Similarly,  the 
reconstructed  object  beams  are  also  imaged,  so  that  they  are  incident  on  the  secondary 
holographic  medium,  with  amplitude  and  phase  identical  to  that  during  recording  of  the 
master  hologram.  The  appropriate  pairs  of  beams  interfere  in  the  secondary  holographic 
medium,  making  a  complete  copy  of  the  original  hologram.  (As  shown  in  Figure  15.32,  the 
copy  will  actually  be  a  spatially  inverted  version  of  the  original.  A  slight  variant  of  the 
optical  system  depicted  in  the  figure  can  produce  a  copy  that  is  identical  to  the  original.) 
Thus  it  is  conceivable  to  mass  produce  copies  of  a  previously  trained  interconnection  pat¬ 
tern,  without  ever  knowing  exactly  what  the  interconnection  weights  are. 


FIGURE  15.32  Optical  layout  for  copying  the  entire  contents  of  a  three- 
dimensional  volume  holographic  optical  element  (VHOE)  into  a  second  VHOE, 
utilizing  a  two-dimensional  array  of  individually  coherent,  but  mutually  inco¬ 
herent  sources. 


We  conclude  this  section  with  a  brief  summary  of  the  current  implementation  status 
of  this  particular  photonic  approach  to  neural  network  fabrication.  For  the  neuron  unit 


arrays,  6x6  arrays  of  dual-channel  detectors  integrated  with  neuron  function  electronics 
have  been  fabricated  in  silicon  and  operate  correctly.  Individual  multiple  quantum  well 
( InGaAs/GaAs )  modulators  have  been  successfully  fabricated  and  tested,  and  exhibit 
drive  voltages  compatible  with  the  electronics.  The  novel  doubly  angularly  multiplexed  in¬ 
coherent/coherent  interconnection  technique  has  been  tested  experimentally  at  the  level  of 
two  inputs/two  outputs,  and  simulated  at  the  level  of  four  inputs/four  outputs  all  with  very 
favorable  results  [Jenkins,  1990c;  Asthana,  1990b;  Asthana,  1990c].  In  addition,  several 
learning  algorithms  that  incorporate  some  of  the  unique  features  of  the  optical  hardware 
have  been  successfully  designed  and  simulated.  Large  2-D  arrays  of  laser  diodes  that  are 
not  mutually  coherent  have  been  fabricated  recently  [Jewell.  1990;  Orenstein.  1990a;  Yon 
Lehmen,  1990].  Photorefractive  crystals  are  routinely  grown  commercially,  and  can  be 
purchased  from  vendors  for  use  at  visible  as  well  as  infrared  wavelengths.  In  addition, 
the  basic  requisite  features  of  the  doubly  angularly  multiplexed  incoherent /coherent  holo¬ 
graphic  recording  techniques  have  been  demonstrated  in  single  crystals  of  bismuth  silicon 
oxide  ( Bii2Si02o ),  though  not  as  yet  at  infrared  wavelengths  compatible  with  both  the 
laser  diode  source  array  and  the  multiple  quantum  well  spatial  light  modulators.  All  of  the 
other  components  in  the  architecture  (lenses,  beamsplitters,  etc.)  are  essentially  available 
off  the  shelf. 

As  with  any  research  project  in  progress,  several  questions  pertaining  to  the  photonic 
approach  outlined  herein  remain  partially  unanswered.  Consider,  for  example,  the  inco¬ 
herent/coherent  source  array.  Given  the  current  state  of  the  art  of  laser  diode  arrays,  the 
total  power  dissipation  will  limit  the  number,  maximum  optical  power,  and  spacing  of 
the  individual  sources.  Cross-coherence  among  the  sources  can  cause  undesirable  crosstalk 
among  corresponding  interconnections,  although  in  some  neural  network  models  a  small  to 
moderate  degree  of  interconnection  crosstalk  is  not  likely  to  cause  intolerable  degradation 
in  performance.  Fortunately,  a  larger  spacing  of  sources  implies  that  each  laser  can  output 
a  higher  power,  and  also  assures  a  higher  degree  of  mutual  incoherence.  Other  remaining 
questions  include  the  achievable  contrast  ratio  and  uniformity  of  the  spatial  light  modu¬ 
lators;  suitable  monolithic  or  hybrid  techniques  for  integrating  detectors,  electronics  and 
modulators;  optimization  of  the  learning  algorithm  relative  to  the  chosen  holographic  ma- 


terial's  storage  and  erasure  time  constants;  and  linearity  and  limitations  of  the  hologram 
copying  process.  The  next  section  discusses  fundamental  and  technological  limitations  of 
the  photonic  hardware  and  their  impact  on  the  performance  of  photonic  neural  network 
architectures. 

FUNDAMENTAL  PHYSICAL  AND  TECHNOLOGICAL 
LIMITATIONS  OF  NEURO-OPTICAL  COMPUTATION 


Even  though  we  are  relatively  early  on  in  the  development  of  viable  neuro-optical  com¬ 
puting  systems,  it  is  not  too  early  to  begin  asking  questions  about  the  ultimate  boundaries 
that  may  impact  our  future  achievements.  This  line  of  inquiry  can  have  a  two-  fold  im¬ 
pact.  First,  discovery  of  inherently  fundamental  physical  limitations  that  affect  all  forms 
of  computation  can,  if  correctly  applied  to  the  neural  computational  paradigm,  both  pro¬ 
vide  us  with  an  ultimate  goal  worthy  of  achievement,  and  perhaps  warn  us  in  advance  of 
architectural  choices  that  will  prove  unworthy  of  technological  implementation.  Second, 
careful  analysis  of  the  technological  limitations  (device  performance  boundaries  within  a 
given  technological  implementation)  that  affect  system  performance  can  provide  us  with 
necessary  guidance  in  choosing  among  many  possible  implementation  strategies.  The  goal, 
of  course,  is  to  come  up  with  the  right  combination  of  implementation  strategy  and  tech¬ 
nological  choices  to  achieve  the  highest  computational  throughput  (or  perhaps  learning 
rate)  based  on  any  one  of  a  number  of  metrics.  In  this  section,  then,  we  discuss  both  the 
fundamental  physical  and  technological  limitations  that  impact  the  future  performance  of 
neuro-optical  computational  systems. 


The  Energy  Metric 

Your  brain  is  truly  a  remarkable  instrument  from  a  computational  point  of  view  (as  well 
as  from  many  other  points  of  view!).  Although  estimates  (as  well  as  individuals!)  vary,  it 


is  thought  that  vour  brain  consists  of  about  10u  neurons,  each  interconnected  (in  certain 
regions  of  the  brain)  to  103  -  104  other  neurons  [Changeux,  1985;  Dowling,  1987:  Hubei. 
1979].  The  human  brain  exhibits  both  short  and  long  term  memory,  performs  sophisti¬ 
cated  image  analysis  in  fractions  of  a  second,  operates  as  an  effective  associative  memory 
integrated  over  a  whole  lifetime  of  learning,  and  yet  operates  on  a  power  budget  that  is 
only  a  fraction  of  the  power  dissipated  by  the  average  light  bulb  in  your  home  [Iversen. 
1979].  In  order  to  accomplish  this,  the  active  switching  elements,  the  neurons,  operate 
at  an  average  power  level  about  seven  orders  of  magnitude  lower  than  that  characteristic 
of  VLSI  logic  circuits  [Mead,  1989b].  If  this  were  not  possible,  it's  likely  that  you'd  be 
running  a  temperature  even  without  the  flu! 

This  discussion  points  to  one  of  many  possible  metrics  by  which  computational  systems 
can  be  judged:  energy  (or  power)  dissipation.  In  fact,  many  modern  supercomputers  are 
limited  in  performance  precisely  because  of  power  dissipation  boundaries,  or  the  ability  to 
extract  the  heat  generated  by  the  computational  process  from  the  volume  used  to  perform 
the  work.  YVe  can  perhaps  think  of  computation  as  broken  down  into  three  fundamen¬ 
tal  parts:  representation  oj  information,  implementation  of  computational  complexity .  and 
detection  of  the  results.  From  the  energy  metric  point  of  view,  everything  costs  energy: 
what  goes  in  costs  energy,  what  comes  out  costs  energy,  and  what  goes  on  in  between  costs 
energy  too.  The  trick  in  building  the  computational  engines  of  the  future  (neural  or  oth¬ 
erwise)  will  be  to  maximize  the  overall  performance  with  a  minimum  expenditure  of  energy. 


Some  Quantum  Limitations 


By  representation  of  information ,  we  mean  the  choice  of  data  representation  on  which 
computations  are  performed.  Some  examples  might  include  the  binary  representation.  M- 
ary  representations,  an  analog  representation,  or  the  residue  representation  [Huang,  1979]. 
This  choice  has  implications  at  the  fundamental  level  for  the  energy  cost  to  represent  a 
number  within  a  given  probability  of  error.  For  example,  if  we  detect  an  optical  signal 
bit  that  is  binary  encoded  with  a  so-called  “ideal"  detector  that  can  tell  the  difference 


between  receiving  exactly  zero  photons  and  one  or  more  photons,  it  only  takes  ten  photons 
on  the  average  to  guarantee  that  the  signal  is  received  with  a  probability  of  error  of  one 
part  in  a  billion,  or  a  "bit  error  rate  (BER)”  of  10-9.  The  average  photon  at  a  typical 
optical  communications  wavelength  of  1300  nm  has  an  energy  of  only  1.5  x  10-19  joules,  so 
the  total  energy  cost  per  bit  is  1.5  attojoules  (1.5  x  10~18  joules).  For  a  communications 
channel  operating  at  10  gigabytes  (8  x  109  bits)  per  second,  this  implies  a  power  dissipation 
due  to  representation  cost  alone  (without  worrying  yet  about  the  transmission  or  detection 
of  the  information)  of  only  0.12  microwatts.  For  currently  available  detectors,  about  a 
thousand  photons  are  required  to  achieve  the  same  BER.  so  the  necessary  representation 
cost  increases  to  12  microwatts.  In  most  currently  envisioned  communications  systems, 
this  cost  is  overwhelmed  bv  other  factors. 

But  what  if  we  chose  to  represent  numbers  in  an  analog  representation  instead?  If  we 
were  to  follow  the  same  kinds  of  quantum  statistical  rules,  we  would  find  that  to  represent 
the  number  “1000”.  say,  with  an  effective  bit  error  rate  of  10~9  requires  about  150  million 
photons  [Tanguay,  1988].  This  is  about  15  million  times  larger  than  the  representation  cost 
of  a  single  binary  bit,  and  about  1.5  million  times  larger  than  the  binary  representation 
cost  of  the  number  1000. 

If  we  assume  that  the  analog  representation  need  only  cover  numbers  between  0  and 
1000,  then  the  dependence  of  the  probability  of  error  on  the  number  of  photons  used  to 
represent  the  highest  number  (1000)  is  given  in  Figure  15.33.  Interestingly,  even  if  we  are 
willing  to  give  up  on  a  couple  of  orders  of  magnitude  of  error  probability,  our  energy  cost 
isn’t  reduced  very  much.  In  fact,  it  costs  about  27  million  photons  to  represent  1000  with 
1%  error,  and  about  11  million  photons  to  represent  it  with  as  much  as  10%  error.  These 
numbers  can  all  be  reduced  by  about  two  orders  of  magnitude  if  we  are  willing  to  give  up 
a  factor  of  ten  in  dynamic  range,  limiting  the  highest  representable  number  to  100  instead 
of  1000,  as  shown  in  the  Figure. 


FIGURE  15.33  The  single  pixel  probability  of  error  P(Err)  as  a  function 
of  the  number  of  photons  detected  within  each  pixel,  for  the  cases  of  100  and 


1000  analog  grey  levels. 


These  errors  arise  fundamentally  from  the  quantum  statistical  nature  of  light,  and  from 
the  fact  that  we  just  can't  guarantee  the  number  of  photons  in  a  packet  of  light  (without 
resorting  to  exotic  things  like  "squeezed  states'’,  which  have  their  own  practical  limitation- 
s  as  well  as  other  costs).  In  the  brain,  of  course,  it  is  currently  thought  that  many  (but 
not  all)  of  the  quantities  involved  in  signal  transmission,  both  electrical  and  chemical,  are 
analog  in  nature. 

The  Incorporation  of  Computational  Complexity 


Given  the  fact  that  it  is  considerably  more  expensive  to  represent  quantities  in  analog 
as  opposed  to  binary  form,  why  don’t  we  always  choose  to  compute  in  the  binary  repre¬ 
sentation?  The  answer  is  that  many  operations  are  less  energy  consumptive  to  perform 
in  the  binary  representation,  but  others  are  not.  The  difference  lies  in  the  degree  of  com 
putational  complexity  that  can  be  implemented  on  a  given  representation  for  a  particular 
computational  operation  within  a  chosen  technological  implementation.  For  our  purposes 
here,  we  may  define  the  computational  complexity  of  a  given  operation  as  the  minimum 
number  of  irreducible  binary  bit  operations  (over  all  possible  computational  algorithms 
and  machine  architectures)  required  to  complete  the  calculation  assuming  that  the  data  is 
represented  in  binary  throughout. 

For  operations  of  low  computational  complexity  such  as  transferring  data  from  the  CPU 
to  memory  or  logic  and  control  operations,  computation  in  the  binary  representation  tends 
to  have  a  significant  energy  consumption  advantage  at  the  fundamental  limits  (as  well  as 
at  the  current  technological  limits  for  both  electronic  and  photonic  processors).  On  the 
other  hand,  for  operations  of  high  computational  complexity  such  as  the  two-dimensional 
Fourier  transform  that  require  a  very  large  number  of  irreducible  binary  operations  to 
perform  (for  the  optimum  algorithm),  computation  in  the  analog  representation  tends  to 
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exhibit  lower  overall  energy  consumption,  particularly  in  photonic  implementations. 


The  Hybrid  Representation  Concept 


In  the  case  of  neural  networks,  a  number  of  characteristic  types  of  computational  op¬ 
erations  are  typically  employed,  including  for  example  the  calculation  of  weight  updates 
and  storage  of  updated  weights,  the  fan-out  of  neuron  unit  outputs,  the  multiplication 
of  fanned  out  outputs  by  weights,  the  communication  of  weighted  signals,  the  fan-in  and 
summation  (or  differencing)  of  weighted  inputs,  and  the  thresholding  of  summed  inputs  to 
form  neuron  unit  outputs.  These  operations  span  a  wide  gamut  of  computational  as  well  as 
physical  complexity.  As  such,  we  suggest  that  optimum  neural  network  performance  from 
an  energy  metric  viewpoint  may  turn  out  to  be  best  achieved  with  a  hybrid  representation, 
in  which  the  signal  representation  is  essentially  binary  for  certain  functions,  and  essentially 
analog  for  others. 

An  example  of  the  use  of  this  hybrid  representation  concept  is  the  use  of  a  hard  thresh¬ 
old  function  within  each  neuron  unit  to  create  a  two  state  output  (on  and  off),  and  the  use 
of  analog  holographic  storage  for  all  interconnection  weights,  as  described  in  a  previous 
section.  In  this  case,  the  inputs  to  each  neuron  unit  are  analog,  while  the  outputs  from  each 
neuron  unit  are  binary.  Multiplications  are  performed  in  a  fully  hybridized  representation 
(multiplicands  are  analog  while  multipliers  are  binary),  but  summations  are  fully  analog 
(the  superposition  of  fanned-in  input  intensities).  Given  a  particular  choice  of  implemen¬ 
tation  technology,  then,  the  overall  power  budget  for  a  given  hybrid  representation  can 
be  established  and  compared  with  similar  power  budgets  for  fully  analog  and  fully  binary 
representations. 


The  Inherent  Costs  of  Interconnections 


Regardless  of  the  representation  chosen,  it  is  clear  that  any  computational  energy  bud- 


get  must  take  into  account  the  non-negligibie  cost  of  the  interconnections  themselves. 
Interconnections  characteristic  of  neural  networks  are  merely  a  form  of  weighted  commu¬ 
nication  channels,  characterized  by  a  high  degree  of  fan-out  and  fan-in.  For  both  electronic 
and  photonic  implementations  that  have  adaptive  weights,  it  takes  energy  to  calculate  the 
weight  updates,  it  takes  energy  to  store  the  resultant  updated  weights,  it  takes  energy  to 
perform  the  multiplications  implied  by  the  weighting  of  the  output  signals,  and  it  takes 
energy  to  communicate  the  various  signals  between  layers. 

In  the  electronics  case,  these  energy  costs  derive  from  charging  up  the  capacitances  of 
switching  devices  in  the  various  forms  of  memory,  flipping  switches  in  the  various  arith¬ 
metic  operations  (both  addition  and  multiplication)  for  binary  representations,  operating 
linear  and  nonlinear  devices  for  analog  representations,  and  charging  and  discharging  the 
capacitance  associated  with  output  line  drivers  as  well  as  the  capacitance  associated  with 
the  physical  interconnections  (wires)  among  the  various  parts  of  the  circuit.  In  the  pho¬ 
tonics  case,  the  comparable  energy  costs  derive  from  the  generation  of  light  by  coherent 
optical  sources,  the  holographic  recording  of  weights  and  weight  updates,  the  throughput 
losses  engendered  by  readout  of  the  holographically  stored  (and  multiplexed)  interconnec¬ 
tion  matrix,  any  throughput  losses  associated  with  the  fan-out  and  fan-in  processes,  and 
the  inter-and  intralayer  communication  costs. 

The  bottom  line  is  that  in  most  cases,  complex,  highly  multiplexed  interconnection 
networks  with  a  high  degree  of  fan-in  and  fan-out  are  very  energy  consumptive,  and  for 
the  neural  network  case  may  prove  to  be  the  largest  energy  sink. 


The  Inherent  Cost  of  Detection  (Switching) 


No  useful  computational  system  can  avoid  the  costs  of  detection,  both  of  intermediate 
results  that  are  essential  to  following  calculations,  and  of  the  sought-after  answers  or  output 
states  that  initiate  subsequent  actions  or  analysis.  And  this  is  the  cost  above  all  costs  that 
we  are  certainly  willing  to  pay,  as  answers  or  outputs  validate  the  usefulness  of  the  system 
and  its  design.  As  pointed  out  earlier,  there  are  three  primary  areas  in  which  detections 
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are  essential:  in  the  generation  of  summed  inputs  prior  to  functional  transformation  into 
individual  neuron  unit  outputs  (usually  on  the  input  side  of  an  optically  addressed  spatial 
light  modulator),  in  the  generation  of  specific  system  outputs,  and  in  the  holographic 
recording  of  interconnection  weights. 

The  physical  process  of  detection  inherently  involves  the  dissipation  of  finite  energy  f if 
accomplished  in  finite  time  within  prescribed  uncertainties,  as  is  appropriate  for  compu¬ 
tation),  since  it  necessarily  involves  the  irreversible  switching  of  the  state  of  a  physical 
component,  as  well  as  the  guarantee  that  the  switched  state  will  be  maintained  over  the 
time  period  of  measurement  without  fluctuations  due,  for  example,  to  thermally  induced 
statistical  variations.  This  is  particularly  true  in  highly  distributed  computing  systems 
such  as  neural  networks  that  depend  on  a  certain  degree  of  predictability  and  synchro¬ 
nization  of  communicated  results  for  progressive  computation.  As  such  this  is  not  really 
a  fundamental  physical  limitation,  but  rather  is  a  technological  limitation  imposed  by  the 
system  designer,  who  would  really  like  to  see  some  intelligible  output  from  the  system  in 
the  near  future. 


Optimization  of  the  Computational  Architecture 


The  design  of  a  computational  architecture  in  many  ways  fixes  the  fundamental  perfor¬ 
mance  limitations  of  the  system,  as  choices  must  be  made  about  the  representation  of  data 
within  the  architecture,  the  methods  employed  for  the  implementation  of  computational 
complexity,  and  the  frequency  and  nature  of  the  detections  required  for  both  intermediate 
and  final  results.  For  neural  networks  capable  of  sophisticated  operations,  optimization 
of  the  computational  architecture  against  one  or  more  metrics  (such  as  total  energy  cost 
for  a  given  computation,  or  total  power  at  a  given  operating  frequency)  will  necessitate 
an  appropriate  balance  among  the  various  representations  employed,  as  well  as  among 
the  physical  mechanisms  employed  to  accomplish  the  necessary  computations.  A  further 
balance  must  be  struck  between  the  fraction  of  the  computational  burden  that  is  assigned 
to  interconnections,  and  the  fraction  that  is  accomplished  by  switching  (whether  logic. 


arithmetic,  or  detection  of  results). 

For  many  classes  of  computational  problems,  the  neural  network  paradigm  may  prove 
to  be  nearly  optimal  even  in  the  regime  in  which  all  of  the  individual  components  are 
assumed  to  be  operating  at  their  respective  fundamental  physical  performance  bound¬ 
aries.  Relative  to  a  modern  digital  supercomputer,  certainly,  neural  networks  seemingly 
offer  an  unusual  mix  of  hybrid  representations  (primarily  analog),  interconnections  (highly 
multiplexed  as  wrell  as  weighted),  and  switching  (infrequent  relative  to  the  rate  at  which 
interconnections  are  utilized).  It  is  at  the  very  least  intriguing  to  imagine  whether  or  not 
our  biological  heritage  has  stored  within  it  a  useful  clue  about  highly  efficient  computation 
for  truly  sophisticated  problems. 


Technological  Limitations 


The  choice  of  a  technological  base  (or  bases)  within  which  to  design  a  neural  network 
with  a  large  number  of  neuron  units  and  a  high  degree  of  connectivity  implies  yet  anoth¬ 
er  set  of  performance  constraints  above  and  beyond  the  fundamental  physical  boundaries 
refered  to  above.  These  technological  limitations  may  not  yet  have  been  reached  with¬ 
in  the  development  of  a  given  technology,  but  can  at  least  be  estimated  given  what  we 
know  about  the  pussies  of  operation  of  the  devices  in  question.  One  such  technological 
limitation,  for  example,  governs  the  total  energy  dissipation  density  that  can  be  tolerated 
on  a  given  semiconductor  substrate  without  either  an  unacceptable  temperature  rise  that 
affects  device  performance,  or  resort  to  extraordinary  cooling  measures  (that  may  prove  to 
be  unfeasible  in  an  optical  path).  As  a  second  example,  the  energy  required  to  represent 
a  single  bit  using  current  digital  logic  circuits  integrated  in  silicon  is  about  seven  orders  of 
magnitude  above  the  thermal  fluctuation  limit  [Tanguay,  1988]. 

In  a  previous  section  of  this  chapter,  we  discussed  a  particular  photonic  implementation 
strategy  for  neural  networks  that  involved  specific  technological  choices  for  the  various 
types  of  components  required  by  the  architecture.  At  the  present  state  of  development  of 
photonic  computational  systems,  we  do  not  have  the  luxury  of  designing  everything  within 


I 


a  single  technological  base,  as  is  the  case  perhaps  for  computational  subsystems  based  on 
}  VLSI  chips.  "Optical  silicon"  has  not  yet  emerged,  or  at  the  very  least  has  not  yet  been 

identified  and  recognized  as  such,  even  though  numerous  candidates  have  been  intensively 
investigated. 

^  Perhaps  the  leading  candidate  at  the  current  time  is  the  compound  semiconductor 

system  based  on  gallium  arsenide  (Ga.4s)  and  including  related  ternary  compounds  such 
as  indium  gallium  arsenide  (/nxGai_x.4s)  and  aluminum  gallium  arsenide  (.4/xGai_x.4s). 
Within  this  system,  at  least,  sources,  source  arrays,  spatial  light  modulators,  integrated 
►  electronic  circuitry,  volume  holographic  optical  elements,  detectors,  and  detector  arrays 

have  all  been  fabricated  and  evaluated  with  varying  degrees  of  success.  What  has  not 
been  established  to  date  is  the  mutual  compatibility  of  all  of  these  elements  operating 
within  a  given  systems  context.  This  demonstration  of  mutual  compatibility  in  all  rel¬ 
evant  performance  specifications  is  essential,  because  in  a  highly  interconnected  system 
the  overall  performance  achieved  is  often  most  strongly  influenced  by  the  component  with 
the  least  desirable  characteristics.  An  obvious  example  is  that  of  a  single  channel  opti- 
|  cal  communications  link,  for  which  the  transmission  bandwidth  will  be  delimited  by  the 

lowest  bandwidth  component  among  the  source/modulator,  transmission  medium,  and 
detector/amplifier. 

In  the  remainder  of  this  section,  we  briefly  discuss  a  number  of  the  types  of  tech- 
^  nological  limitations  that  will  impact  the  performance  of  currently  envisioned  photonic 

implementations  of  neural  networks. 

With  regard  to  sources  and  source  arrays,  the  principal  technological  issues  are  the 
^  minimization  of  laser  thresholds  to  allow  for  parallel  operation  of  a  large  number  of  sources 

on  a  single  chip,  the  coherence  length  achievable  with  ultrashort  cavity  surface  emitting 
lasers  when  fabricated  in  an  array  (which  impacts  the  holographic  recording  process),  the 
uniformity  of  wavelength  across  the  array  (particularly  for  parallel  readout  of  wavelength 
^  sensitive  devices  such  as  multiple  quantum  well  spatial  light  modulators),  and  both  the 

short  term  (process-determined)  yield  and  the  long  term  reliability  of  individual  sources 
within  a  large  scale  array. 

For  the  case  of  spatial  light  modulators,  key  technological  issues  include  the  maximum 
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density  of  neuron  units  that  can  be  integrated  on  a  monolithic  or  hybrid  chip  with  ap¬ 
propriate  detectors,  control  circuitry,  and  modulators  within  each  pixel;  the  sensitivity  to 
input  intensity;  the  neural  unit  functionality  (and  perhaps  programmability)  that  can  be 
achieved  at  the  minimum  cost  in  real  estate  and  energy  dissipation;  the  contrast  ratio  and 
uniformity  of  the  contrast  ratio  across  the  array  of  pixels  (neuron  units);  the  achievable 
dynamic  range  of  the  input/output  transfer  function;  and  the  operational  bandwidth  that 
can  be  reached  assuming  a  50%  duty  cycle  for  each  neuron  unit  (which  determines  the 
total  power  dissipation  of  the  chip). 

The  high  degree  of  interconnectivity  envisioned  for  photonic  implementations  of  neural 
networks  hinges  primarily  on  the  achievement  of  appropriate  functionality  in  the  volume 
holographic  optical  elements  used  to  record  and  store  interconnection  weights,  produce 
fan-in  and  fan-out  from  each  neuron  unit,  and  allow  for  highly  parallel  readout  of  the 
weighted  interconnection  network.  Key  technological  limitations  for  currently  investigated 
photorefractive  materials  include  the  optical  quality  routinely  achievable  in  large  (1  cubic 
inch)  single  crystal  samples,  the  storage  capacity  of  the  medium  as  determined  by  the 
highest  spatial  frequency  gratings  that  can  be  recorded,  the  sensitivity  for  recording  of 
updated  weights  [Johnson,  1988]  at  the  source  wavelength  (which  in  turn  determines  the 
source  power  necessary  to  initiate  weight  updates  during  the  learning  phase),  the  poten¬ 
tial  for  “fixing”  of  the  stored  interconnection  weights  to  allow  for  nondestructive  readout 
during  computation,  and  the  capability  of  copying  the  contents  of  the  stored  interconnec¬ 
tion  matrix  into  another  holographic  medium  (in  order  to  provide  the  capacity  for  mass 
production  of  fixed  pattern  interconnections  following  a  training  sequence  executed  with  a 
dynamic  medium).  Perhaps  the  most  important  technological  limitation  of  a  given  holo¬ 
graphic  recording  medium  will  prove  to  be  the  total  number  of  weight  update  cycles  that 
can  be  initiated  without  complete  erasure  of  the  weight  updates  recorded  during  the  very 
first  training  cycle.  This  number  in  effect  sets  an  upper  bound  on  the  learning  capacity  of 
the  photonic  neural  network. 

The  primary  technological  limitations  of  importance  to  single  pixel  detectors  as  used, 
for  example,  on  the  input  side  of  optically  adresscd  spatial  light  modulators,  include  the 
sensitivity  of  the  detector/amplifier  combination  (which  together  with  the  spatial  light 
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modulator  gain  determines  the  overall  loop  gain  for  a  single  computational  iteration),  the 
modulation  bandwidth  in  conjunction  with  following  circuitry,  and  the  chip  area  required 
to  achieve  the  desired  sensitivity  and  bandwidth  tradeoff.  Depending  on  the  technological 
base  within  which  the  spatial  light  modulator  is  fabricated,  the  potential  for  integration 
with  control  circuitry  and  in  some  cases  the  modulation  elements  themselves  provides  an 
additional  constraint. 

For  detector  arrays,  many  of  the  same  issues  apply  with  the  additional  constraints  of 
uniformity  of  each  performance  parameter  across  the  array,  and  the  reliability  of  the  full 
array  of  pixels.  Another  important  technological  issue  is  the  frame  rate  for  readout  of  the 
entire  array  at  a  given  pixel  density,  which  is  determined  both  by  the  technological  base 
within  which  the  array  is  fabricated,  and  by  the  physical  structure  of  the  array  and  its 
readout  configuration.  As  discussed  in  an  earlier  section,  charge-coupled-device  (CCD) 
arrays  are  typically  read  out  by  temporally  multiplexing  the  contents  of  a  full  frame  onto 
one  or  a  few  high  bandwidth  serial  outputs.  The  contents  of  each  row  of  stored  charge 
packets  generated  during  the  exposure  cycle  are  shifted  out  to  a  high  speed  serial  readout 
buffer,  which  reads  out  one  entire  column  of  pixels  in  between  each  lateral  shift  of  the  rows 
as  shown  schematically  in  Figure  15.34. 


FIGURE  15.34  Illustration  of  parallel-to-serial  conversion  in  two-dimensional 
detector  arrays  such  as  the  charge-coupled-device  (CCD)  array.  Charge  accu¬ 
mulated  within  each  photosensitive  region  during  exposure  is  transferred  lat¬ 
erally  by  a  set  of  row-parallel  shift  registers  to  a  high  speed  parallel-to-serial 
shift  register,  which  reads  out  the  entire  array  one  column  at  a  time. 


This  parallel-to-serial  conversion  limits  the  frame  readout  rate  to  the  maximum  serial 
transfer  rate  achievable  in  the  readout  buffer,  divided  by  the  number  of  pixels  in  the  array. 
For  example,  if  the  array  is  2048  x  2048  pixels  in  size  with  a  readout  buffer  operating  at 
200  MHz,  the  frame  rate  will  be  limited  to  about  60  frames  per  second.  For  many  neural 
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network  applications,  this  frame  rate  may  be  more  than  sufficient  for  access  to  the  desired 
outputs  (including  the  time  required  for  temporal  demultiplexing  of  the  output).  In  cases 
that  demand  higher  frame  rates,  the  array  can  be  segmented  so  that  multiple  readout 
buffers  can  be  used,  each  accessing  a  fraction  of  the  total  number  of  rows  in  the  array. 

An  unusual  feature  of  identifying  the  technological  constraints  that  bound  any  neural 
network  implementation,  whether  it  be  electronic,  photonic,  chemical,  mechanical,  or  all 
of  the  above,  is  the  fact  that  we  just  don't  know  enough  yet  about  the  operation  of  highly 
interconnected  nonlinear  systems  of  large  dimension  to  fully  assess  the  impact  of  a  par¬ 
ticular  constraint  on  the  overall  system  operation.  One  example  is  the  degree  to  which 
nonuniformities  in  the  neuron  units  themselves  (e.g.  in  their  sensitivity,  contrast  ratio,  or 
overall  response  function)  or  in  the  interconnection  medium  can  be  tolerated  by  an  archi¬ 
tecture  that  is  to  a  large  extent  self-organizing.  A  second  example  is  the  necessity  within 
a  certain  neural  network  paradigm  of  implementing  precisely  the  right  nonlinearity  that 
translates  summed  neuron  inputs  to  neuron  outputs  (or,  for  that  matter,  the  very  exis¬ 
tence  of  nonlinearities  in  the  recording  and  storage  of  weight  updates,  and  in  the  readout 
of  the  full  weighted  interconnection  pattern).  Some  of  these  types  of  questions  may  be 
amenable  to  simulation,  but  in  other  cases  we  may  have  to  await  the  results  from  actual 
implementations  to  refine  our  understanding  of  the  technological  requirements. 

THE  FUTURE  OF  NEURO-OPTICAL  COMPUTATION 

A  wide  variety  of  photonic  architectures  and  components  are  currently  under  intensive 
investigation  for  neural  network  applications.  A  thorough  discussion  of  these  alternative 
strategies  and  their  strengths  and  weaknesses  is  unfortunately  beyond  the  scope  of  this 
chapter.  In  this  final  section,  we  address  the  future  prospects  of  neuro-optical  computa¬ 
tion  from  the  point  of  view  of  those  critical  issues  that  are  common  to  all  such  proposed 
implementation  strategies. 


The  Critical  Issues 


As  with  any  emerging  technological  breakthrough,  successful  advanced  development  of 
photonic  neural  networks  will  require  that  the  technology  prove  to  be  manufacturable,  in 
the  sense  that  it  is  amenable  to  mass  production  techniques  at  reasonable  cost;  flexible 
in  design ,  in  that  the  technological  base  provides  significant  degrees  of  freedom  for  ar¬ 
chitectural  and  functional  variations;  and  leveraged  as  much  as  possible  by  developments 
in  related  technologies  that  can  offload  a  significant  fraction  of  the  development  time  and 
costs.  Implicit  in  these  three  key  features  is  the  issue  of  the  component  uniformity  that  can 
be  achieved  over  large  array  sizes,  and  the  related  issue  of  the  scalability  of  the  technology 
to  large  scale  systems  either  by  increases  in  the  basic  array  sizes  or  by  the  incorporation 
of  a  modular  design  from  the  outset. 

In  all  useful  computational  systems,  the  bottom  line  is  to  a  large  extent  determined 
by  the  maximum  amount  of  computational  throughput  capacity  that  can  be  squeezed 
into  the  smallest  system  volume  within  a  tolerable  energy  dissipation  constraint  or  power 
budget.  In  the  case  of  photonic  neural  networks  that  utilize  the  implementation  strategy 
outlined  in  a  previous  section  of  this  chapter,  the  two  most  important  factors  that  influence 
the  computational  throughput  capacity  are  the  storage  capacity  of  the  volume  holographic 
optical  element,  and  the  operational  bandwidth  of  the  neuron  units  (nonlinear  spatial  light 
modulators). 

The  storage  capacity  of  a  holographic  interconnection  medium  is  in  turn  determined 
by  the  maximum  number  of  independent  weighted  interconnections  that  can  be  recorded 
and  retrieved  per  unit  volume.  Although  we  discussed  this  issue  in  an  earlier  section  from 
a  theoretical  viewpoint  (and  in  yet  another  section  from  an  architectural  viewpoint),  we 
have  not  addressed  herein  the  even  more  important  question  of  the  actual  density  of  inde¬ 
pendent  interconnections  achievable  with  photorefractive  (or  other  photosensitive  real  time 
materials)  that  contain  a  considerable  number  of  scattering  centers  and  exhibit  diffraction 
efficiencies  that  depend  strongly  on  the  spatial  frequency  of  the  recorded  grating.  Demon¬ 
stration  of  a  high  density  of  weighted  interconnections  with  low  interchannel  crosstalk  in  a 


real  time  holographic  recording  medium  will  provide  a  benchmark  of  achievement  for  pho¬ 
tonic  implementation  strategies,  as  well  as  a  metric  by  which  one  can  more  appropriately 
estimate  eventual  system  performance.  Demonstration  of  parallel  weight  updates  at  high 
sensitivity  (requiring  tolerable  optical  source  intensities)  during  a  complete  training  cvcle 
is  also  necessary  for  a  convincing  proof  of  system  feasibility. 

The  operational  bandwidth  of  two-dimensional  spatial  light  modulators  that  can  be 
used  as  neuron  unit  arrays  will  prove  to  be  orders  of  magnitude  larger  than  the  bandwidth 
characteristic  of  biological  systems.  Feasibility  analyses,  as  well  as  preliminary  device 
characterization  studies,  indicate  no  fundamental  or  technological  barriers  to  operation  of 
individual  neuron  units  at  bandwidths  exceeding  100  MHz  [Asthana.  1990b].  For  a  100  x 
100  element  neuron  unit  to  switch  at  100  MHz  with  a  50%  duty  cycle,  however,  generates 
a  watt  of  power  dissipation  for  every  2  picojoules  of  switching  energy  required  by  an 
individual  neuron  unit,  including  the  detector,  amplifier,  control  circuitry,  and  modulator. 
Although  the  neuron  unit  arrays  in  most  neural  network  architectures  will  not  approach 
50%  duty  cycles  from  full  off  to  full  on  in  actual  operation,  this  still  gives  us  a  very  tight 
energy  dissipation  budget,  and  may  eventually  force  a  lowering  of  the  design  bandwidth. 

For  photonic  implementations  of  neural  networks  that  use  optical  imaging  and  holo¬ 
graphic  interconnection  systems  extensively  to  increase  the  computational  throughput  ca¬ 
pacity,  an  important  question  will  continue  to  be  the  fraction  of  “unfilled”  system  volume 
dedicated  to  wavefront  and  beam  propagation.  Miniaturization  of  many  sophisticated  op¬ 
tical  signal  processing  systems  has  been  a  focus  of  effort  only  recently,  and  can  be  expected 
to  produce  significant  system  volume  reductions  through  clever  (as  well  as  careful)  opto¬ 
mechanical  engineering.  For  example,  gradient  index  (GRIN)  techniques  have  been  used  in 
conjunction  with  photolithographic  planar  processing  to  produce  regular  two-dimensional 
arrays  of  diffraction  limited  microlenses  that  can  be  incorporated  in  stacked  planar  optical 
modules  with  greatly  reduced  unfilled  system  volume  [Iga,  1984].  There  are,  however,  sev¬ 
eral  inherent  limits  (both  fundamental  as  well  as  technological)  that  provide  lower  bounds 
on  the  physical  volume  required  to  implement  a  high  degree  of  interconnectivity  among 
planes  of  neuron  units. 

When  all  is  said  and  done,  it  is  certainly  a  fair  question  to  ask  whether  the  physical 


volume  of  a  neuro-optical  computer  module  would  be  better  off  densely  packed  with  sili¬ 
con  chips  that  emulate  the  same  functionality.  All  of  the  preliminary  evidence  gathered  to 
date  suggests  that  the  answer  to  this  question  shifts  rather  dramatically  from  an  emphatic 
“yes"  in  the  limit  of  small  numbers  of  neurons  and  required  interconnections,  to  a  more 
tenuous  “no"  as  the  number  of  neurons  and  density  of  required  interconnections  continues 
to  increase.  Perhaps  the  most  interesting  question  for  the  future  of  neuro-optical  compu¬ 
tation  is  the  clear  identification  of  this  performance  boundary. 

The  Incorporation  of  Neural  Paradigms 

In  concluding  this  chapter  on  neuro-optical  computation,  we  assert  that  although  the 
majority  of  preliminary  demonstrations  of  photonic  neural  network  architectures  have 
seemingly  focused  on  associative  memories  in  general,  and  on  variations  of  the  Hopfield- 
Amari  [Hopfield,  1982;  Amari,  1972]  network  in  particular,  it  is  essential  that  photonic 
implementations  have  the  capacity  for  incorporation  of  a  wide  variety  of  neural  network 
architectures,  computational  algorithms,  and  learning  rules.  This  is  particularly  important 
in  view  of  the  early  stage  of  development  that  characterizes  our  current  understanding  of 
the  operational  performance  of  even  the  most  fashionable  neural  network  models,  when 
scaled  up  to  large  numbers  of  densely  interconnected  neuron  units  with  realistic  stochastic 
variations  in  individual  neuron  unit  performance. 

Furthermore,  it  is  likely  that  useful  neural  networks  incorporated  in  a  systems  frame¬ 
work  may  require  either  a  number  of  layers  with  different  characteristics,  or  considerable 
pre-and  post-processing  to  achieve  sophisticated  functionality.  One  example  of  a  neural 
paradigm  that  requires  such  additonal  sophistication  is  the  Dynamic  Link  Architecture  of 
von  der  Malsburg  discussed  in  Chapter  11  [Buhmann,  1991]  as  applied  to  pattern  recogni¬ 
tion  problems  by  means  of  graph  matching  techniques.  A  photonic  implementation  of  this 
architecture  will  likely  require  several  interacting  modules  for  complete  functionality. 

Currently  investigated  photonic  architectures  and  components  for  neural  network  im¬ 
plementation  do  not  yet  enjoy  the  flexibility  of  full-fledged  computer  aided  design  and 


computer  aided  manufacturing  that  is  the  hallmark  of  the  silicon  VLSI  circuit  industry. 
On  the  other  hand,  tremendous  strides  have  been  made  in  just  the  past  few  years  in  the 
simulation  of  even  complex  optical  systems  including  the  effects  of  both  refractive  and 
diffractive  components.  The  next  step,  from  extensive  simulation  capabilities  to  design 
automation,  is  under  active  investigation  and  may  when  taken  herald  the  beginnings  of  a 
viable  photonic-based  neural  network  implementation  technology. 
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1.  Consider  a  system  of  beamsplitters  arranged  to  combine  a  set  of  N  input  beams  to 
form  a  single,  collinear  output  beam  with  an  intensity  proportional  to  an  equally 
weighted  sum  of  all  of  the  inputs.  Choose  a  particular  architecture  for  the  beam¬ 
splitter  arrangement,  and  justify  it  in  terms  of  efficiency,  simplicity,  or  minimization 
of  component  count.  For  the  case  of  incoherent  illumination,  derive  the  optimal 
transmissivities  of  the  beamsplitters  in  your  arrangement,  and  prove  that  the  chosen 
architecture  generates  an  input-output  relationship  in  the  form  of  Equation  (2).  Re¬ 
peat  the  analysis  for  the  case  of  coherent  illumination,  and  derive  the  equivalent  of 
Equation  (8)  that  characterizes  the  chosen  architecture. 

2.  Two  mutually  coherent  beams  of  intensities  \a\2  and  \b\2  are  incident  on  a  detector. 
The  coherent  superposition  of  the  beams  is  given  by,  in  one  dimension, 

A(x)  =  ae;fc,r  +  be]klZ. 

(a)  Plot  the  resulting  intensity,  (A(i)j2,  as  a  function  of  x. 

(b)  Show  that  the  integral  of  |A(x)|2  over  an  integral  number  of  its  periods  is  equal 
to  |a|2  +  |6|2,  thus  proving  that  the  detector's  response  is  equal  to  the  incoherent 
sum  of  the  individual  beams, 

3.  For  the  case  of  a  thin  phase  grating  with  a  sinusoidal  index  modulation  given  by 
n(x)  =  n0  +  nisin/cGX  with  rii  <  no,  calculate  the  value  of  n\d  that  maximizes 
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the  diffraction  efficiency  into  the  first  diffracted  order,  and  the  maximum  diffraction 
efficiency  achievable.  Assume  that  the  grating  is  read  out  by  a  semiconductor  laser 
with  a  wavelength  of  850  nm.  For  a  given  incident  intensity  of  the  optical  readout 
beam,  calculate  the  ratio  of  the  intensity  diffracted  into  the  0th  ,  2nd ,  and  3rd  diffracted 
orders  to  that  diffracted  into  the  Is'  order. 

4.  Consider  the  process  of  diffraction  from  a  thin  amplitude  grating  with  a  spatially 
varying  transmissivity  and  negligible  phase  modulation.  For  a  sinusoidal  transmit¬ 
tance  modulation  given  by  t{x)  =  t0  +  tisinkcx  with  t\  <  t0>  calculate  the  value 
of  1 1  that  maximizes  the  diffraction  efficiency  into  the  first  diffracted  order,  and  the 
maximum  diffraction  efficiency  achievable.  Assume  that  the  grating  is  read  out  by 
a  semiconductor  laser  with  a  wavelength  of  850  nm.  For  a  given  incident  intensity 
of  the  optical  readout  beam,  calculate  the  ratio  of  the  intensity  diffracted  into  the 
0*A  ,  2nd,  and  3rd  diffracted  orders  to  that  diffracted  into  the  1st  order.  Discuss  the 
essential  differences  observed  between  the  amplitude  and  phase  grating  cases. 

5.  Consider  the  process  of  holographic  grating  recording  in  a  thick  holographic  recording 
medium.  Assume  that  the  grating  is  recorded  and  read  out  by  a  semiconductor  laser 
with  a  wavelength  of  850  nm,  and  that  the  angle  included  between  the  two  recording 
beams  is  30  degrees.  What  is  the  spatial  frequency  of  the  recorded  grating?  How 
thick  must  the  hologram  be  in  order  to  generate  a  grating  parameter  Q  (as  defined  in 
the  text)  of  1,000?  What  is  the  approximate  angular  width  of  this  recorded  grating 
(as  measured,  for  example,  by  varying  the  angle  of  incidence  of  the  readout  beam)0 

6.  For  the  thick  holographic  grating  described  in  Problem  4,  calculate  the  amplitude 
of  the  refractive  index  modulation  that  is  necessary  to  achieve  100%  diffraction  ef¬ 
ficiency  on  readout.  How  large  an  absorption  coefficient  can  be  tolerated  in  the 
holographic  recording  medium  at  the  ieadout  wavelength  if  the  absorption  loss  in 
diffraction  efficiency  is  to  be  kept  below  5%  ? 

7.  Consider  the  holographic  interconnection  scheme  depicted  in  Figure  15.9.  First, 
derive  the  basic  relationship  for  a  lens  that  associates  a  given  point  p\  in  the  input 


plane  with  a  resulting  beam  angle  6\.  Given  a  focal  length  of  5  centimeters  for  lenses 
L i  and  L2,  what  is  the  minimum  spacing  required  between  nearest  neighbor  points  in 
the  input  plane  for  a  grating  Q  of  1,000?  What  Q  will  be  required  to  accommodate 
of  order  104  input  positions? 

8.  Design  a  differentiating  circuit  for  incorporation  in  an  optically  addressed  spatial  light 
modulator,  using  the  principles  of  Chapters  14  and  15.  Assume  that  the  detector 
is  a  p-i-n  photodiode,  and  that  the  modulator  can  be  treated  as  a  purely  capacitive 
load.  If  the  modulator  can  be  modeled  as  a  parallel  plate  capacitor  of  dimensions 
30  x  50  pm,  with  a  thickness  of  1  pm  and  a  relative  dielectric  constant  typical  of 
gallium  arsenide  multiple  quantum  well  devices  (e  =  13),  estimate  the  bandwidth 
over  which  the  differentiator  is  operational. 

9.  If  storage  of  one  synaptic  weight  in  VLSI  requires  a  memory  element  15  pm  x  15  pm 
in  size,  what  is  the  maximum  number  of  synaptic  weights  that  can  be  implemented 
on  a  1  cm  x  1  cm  chip?  If  a  VLSI  neural  network  is  fully  connected  using  one  such 
memory  element  for  each  synapse,  how  many  pins  are  required  for  input  to,  and 
output  from,  the  set  of  neuron  units?  How  many  pins  would  be  required  for  parallel 
input  of  the  weights?  If  an  optical  synaptic  weight  can  be  implemented  in  an  effective 
volume  of  5  pm  x  5  pm  x  5  pm,  what  is  the  maximum  number  of  synaptic  weights 
that  can  be  implemented  in  a  volume  1  cm  x  1  cm  x  1  cm ? 

10.  (a)  If  a  neural  network  is  simulated  on  a  digital,  sequential  machine,  how  many  mul¬ 
tiply  operations  and  add  operations  are  required  to  simulate  the  computational 
process  of  a  single-layer  feedforward  network  with  N  neuron  units  and  a  connec¬ 
tivity  (number  of  connections  per  neuron  unit)  of  Ml  If  a  multiply  operation 
can  be  performed  in  100  ns  and  an  add  operation  in  25  ns,  what  is  the  minimum 
time  it  will  take  for  one  pass  through  the  network  if  N  =  106  and  M  =  lO49 

(b)  For  the  outer  product  learning  of  Equation  (24),  neglecting  the  decay  term, 
how  many  multiply  operations  and  add  operations  are  required  for  one  iteration 
of  weight  updates,  in  terms  of  M  and  A'?  For  a  two-laver  network  (1  hidden 
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layer),  with  each  layer  having  .V  =  106  and  M  =  104,  and  assuming  104  different 
patterns  presented  1,000  times  each,  how  long  will  the  network  take  to  be  trained 
(assuming  1  forward  pass  and  1  update  of  all  weights  per  presentation,  100  ns 
per  multiply  operation  and  25  ns  per  add  operation)? 

(c)  For  the  same  numbers  of  (b),  assuming  each  layer  of  a  photonic  system  can 
perform  a  forward  pass  in  100  ns  and  a  set  of  parallel  weight  updates  ,n  1  /rs. 
how  long  will  it  take  to  be  trained? 

11.  (a)  Design  an  algorithm  that  uses  only  nonnegative  signals  outside  of  each  neuron 

unit,  nonnegative  weights,  and  allows  two  separate  inputs  to  and  two  separate 
outputs  from,  each  neuron  unit.  It  should  be  able  to  perform  neural  computation 
and  weight  updates  for  learning.  Your  answer  snould  be  in  the  form  of  a  flow 
chart.  You  may  perform  subtraction  and  division  only  within  each  neuron  unit: 
only  addition,  multiplication,  interconnection  and  storage  can  be  performed 
external  to  the  neuron  units.  (No  need  to  simulate.) 

(b)  After  many  iterations  during  learning,  might  there  be  a  problem  with  weights 
saturating  or  going  out  of  bounds?  If  not.  why  not?  If  so,  conjecture  as  to  how 
this  problem  might  be  avoided. 

12.  In  regard  to  Equation  (24),  find  an  expression  for  6,  for  the  following  algorithms: 

(a)  Perceptron 

(b)  Wiarow-Hoff 

(c)  Least  minimum  square  (back  propagation),  for  a  multi-layer  net  for 

(i)  output  layer 

(ii)  hidden  layers 

Assume  that  f3  =  0  for  this  problem. 

(Note:  This  problem  requires  familiarity  with  neural  networks  not  discussed  in  this 
chapter.) 
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13.  Referring  to  the  system  of  Figure  15.21,  if  the  interconnection  mask  SLM  were  elec¬ 
tronically  addressed  with  a  serial  line,  capable  of  transmitting  analog  values  at  50 
MHz.  and  there  are  106  pixels  (analog  weights),  what  is  the  maximum  frame  rate? 
If  there  are  103  parallel  lines  addressing?  If  the  SLM  is  optically  addressed,  what 
limits  the  frame  rate? 

14.  (a)  Referring  to  Figure  15.23,  show  how  the  following  network  can  be  drawn  as  a 

single  layer  network  with  feedback. 

<<  Insert  Figure  for  Problem  14(a)  >> 

(b)  How  can  the  architecture  shown  in  Figure  15.26  be  modified  to  include  feedback 
connections  within  the  module?  Sketch  the  resulting  architecture. 

15.  Consider  a  charge-coupled-device  array  as  shown  schematically  in  Figure  15.34,  of 
dimension  1000  x  1000  pixels  with  a  serial  readout  buffer  that  operates  at  a  clock 
frequency  of  100  MHz.  Calculate  the  maximum  frame  rate  achievable,  and  the  speed 
required  of  the  row  shift  registers.  Calculate  the  ratio  between  the  total  storage  time 
required  of  the  first  pixel  read  out  to  that  of  the  last  in  each  frame. 


FIGURE  CAPTIONS 


Fig.  15.1  Illustration  of  optical  addition  utilizing  a  50/50  beamsplitter:  (a)  collinear 
incoherent  beam  geometry;  (b'  collinear  coherent  beam  geometry,  showing  input  and 
output  amplitudes ;  (c)  collinear  coherent  beam  geometry,  showing  input  and  output 
intensities. 

Fig.  15.2  Illustration  of  optical  addition  utilizing  mirrors:  (a)  angularly  multiplexed  in¬ 
coherent  beam  geometry;  (b)  angularly  multiplexed  coherent  beam  geometry. 

Fig.  15.3  Illustration  of  optical  multiplication  utilizing  a  medium  with  variable  trans¬ 


parency. 


Fig.  15.4  Fundamental  principles  of  spatial  light  modulator  function:  (a)  block  diagram 
of  the  principal  functions  of  an  optically-addressed  spatial  light  modulator,  including 
the  detection,  functional  implementation,  and  modulation  functions;  (b)  schematic 
diagram  of  an  N  x  N  array  of  spatial  light  modulator  pixels,  in  which  three  pixels 
are  shown  in  different  transmission  states;  (c)  expanded  view  of  the  pixel  arrav. 
showing  an  incomplete  fill  factor  within  each  pixel;  (d)  expanded  view  of  a  single 
pixel  within  the  array,  illustrating  one  possible  pixel  configuration  that  incorporates 
two  detector  elements  D\  and  D2,  control  electronics  for  impedance  matching  and 
functional  implementation,  and  two  modulator  elements,  shown  here  in  different 
transmittance  states. 

Fig.  15.5  Examples  of  monolithically-integrated  spatial  light  modulators.  The  chosen 
examples  incorporate  photodetectors,  control  circuitry,  and  multiple  quantum  well 
modulators  within  each  pixel  on  a  single  gallium  arsenide  ( GaAs )  substrate.  In 
(a),  the  control  electronics  and  photodetector  elements  are  fabricated  following  the 
photolithographic  definition  and  physical  isolation  of  the  modulator  elements,  while 
in  (b)  a  buffer  (isolation)  layer  is  used  to  allow  fabrication  and  interconnection  of  all 
of  the  elements  without  chemical  or  ion  beam  etching. 

Fig.  15.6  Example  of  a  hybrid  spatial  light  modulator,  in  which  the  photodetectors  and 
control  electronics  are  fabricated  on  a  silicon  substrate,  and  the  multiple  quantum 
well  modulator  elements  are  fabricated  on  a  gallium  arsenide  (Ga/4s)  substrate.  The 
two  sets  of  devices  are  bump  contacted  on  a  pixel- by-pixel  basis  to  provide  parallel 
electrical  continuity. 

Fig.  15.7  VLSI  layout  of  a  generalizable  silicon-based  spatial  light  modulator  structure: 
(a)  neuron  pixel  layout;  (b)  photograph  of  a  single  neuron  unit  in  VLSI  implementa¬ 
tion,  with  probe  pads  substituted  for  the  two  detectors  (bottom)  and  for  contact  to 
the  two  modulation  elements  (top);  (c)  photograph  of  a  6  x  6  array  of  neuron  units 
on  a  VLSI  chip  that  incorporates  additional  test  circuitry. 

Fig.  15.8  A  simplified  holographic  recording  configuration:  case  of  plane  wave  signal  and 


reference  beams,  and  a  thin  holographic  recording  medium;  (a)  recording,  and  (b) 
reconstruction  with  a  plane  wave  readout  beam. 

Fig.  15.9  A  point-to-point  interconnection  system,  using  a  holographic  optical  elemen- 
t  (HOE)  for  interconnection  routing,  and  lenses  as  position-to-angle  and  angle-to- 
position  encoders.  In  this  example,  the  holographic  optical  element  effectively  per¬ 
forms  an  input  angle  to  output  angle  transformation,  such  that  light  emitted  (or 
transmitted)  at  point  in  the  input  plane  (Pi)  is  detected  at  point  p2  in  the  output 
plane  (P2). 

Fig.  15.10  Volume  holographic  recording  with  plane  wave  signal  and  reference  beams;  (a) 
recording,  and  (b)  reconstruction,  showing  the  elimination  of  the  higher  diffracted 
orders. 

Fig.  15.11  The  angular  alignment  sensitivity  of  a  volume  holographic  optical  element, 
as  a  function  of  the  dimensionless  Q-parameter  defined  in  the  text.  The  grating 
strength  for  all  of  the  curves  (3.14  radians)  is  optimized  to  produce  100%  diffraction 
efficiency  in  the  limit  of  large  Q  (Bragg  diffraction  regime),  and  is  not  optimized 
for  low  Q  gratings.  Note  that  the  diffraction  efficiency  is  essentially  independent  of 
angle  for  low  Q  gratings,  and  is  very  strongly  peaked  at  the  Bragg  angle  (7.5  degrees 
in  this  case)  for  high  Q  gratings. 

Fig.  15.12  The  diffraction  efficiency  of  thin  (Raman-Nath  diffraction  regime)  and  thick 
(Bragg  diffraction  regime)  holographic  gratings  as  a  function  of  the  grating  strength. 

Fig.  15.13  Schematic  representation  of  a  4  input,  4  output  holographic  interconnection, 
showing  4  coherent  input  beams  ii-i4  and  4  coherent  recording  beams  j/i-jm,  each 
of  which  corresponds  to  a  desired  output  y[-y\.  In  (a),  the  sets  {z,}  and  {y,}  inter¬ 
fere  within  the  volume  holographic  medium,  recording  the  desired  interconnection 
diffraction  gratings.  In  (b),  a  new  set  of  input  beams  {x;}  illuminates  the  volume 
holographic  medium,  reading  out  the  weighted  interconnection  pattern  and  forming 
appropriately  weighted  sums  at  each  of  the  outputs  { y' } . 
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Fig.  15.14  Schematic  representation  of  the  fan-out  process  for  optical  beams,  for  the  case 
of  one  input  and  three  outputs:  (a)  with  beamsplitters  {BSy  -  BSz)\  (b)  with  a  single 
holographic  optical  element  containing  three  multiplexed  (spatially  superimposed) 
diffraction  gratings. 

Fig.  15.15  Schematic  representation  of  the  fan-in  process  for  optical  beams,  for  the  case 
of  three  angularly  distinct  inputs  and  one  combined  collinear  output  beam:  (a)  with 
beamsplitters,  showing  the  unavoidability  of  a  throughput  loss  associated  with  the  set 
of  transmitted  (and  multiply  reflected)  beams;  (b)  with  a  single  holographic  optical 
element  containing  three  multiplexed  (spatially  superimposed)  diffraction  gratings, 
showing  an  analogous  throughput  loss. 

Fig.  15.16  Illustration  of  the  generation  of  crosstalk  in  holographic  optical  interconnec¬ 
tions  due  to  beam  degeneracy:  recording/readout  configuration.  The  input  beams 
{xj}  are  assumed  to  have  interfered  within  the  volume  holographic  medium  with  the 
set  of  recording  beams  {y,},  producing  the  desired  set  of  interconnection  gratings 
with  weights  u>tJ.  Illumination  of  the  volume  holographic  medium  with  beam  Xi  pro¬ 
duces  a  1  to  4  fanout  into  the  output  beams  {y'},  as  well  as  the  zeroth  order  beam 
Xj.  Due  to  the  effects  of  beam  degeneracy,  power  is  also  coupled  into  the  zeroth 
order  beams  x'2-x\,  and  crosstalk  terms  {c,}  are  introduced  into  the  outputs. 

Fig.  15.17  Illustration  of  the  generation  of  crosstalk  in  holographic  optical  interconnec¬ 
tions  due  to  beam  degeneracy:  diffraction  efficiency  as  a  function  of  grating  strength 
for  the  readout  configuration  of  Figure  15.16.  Shown  are  the  depletion  of  the  zero 
order  beam  x\  and  the  rise  of  the  desired  output  beams  y',  accompanied  by  a  strong 
buildup  of  the  cross-coupled  beams  X2-x'4. 

Fig.  15.18  Illustration  of  a  surface-emitting  laser  diode  source  array  [after  Jewell,  1990], 
In  this  example,  the  individual  semiconductor  laser  diodes  are  isolated  by  chemical¬ 
ly  assisted  ion  beam  etching  techniques,  must  be  individually  contacted,  and  emit 
through  the  GaAs  substrate. 


Fig.  15.19  Schematic  diagram  of  a  photodarlington  pair  utilized  as  a  high  gain  detec¬ 
tor/amplifier  combination. 

Fig.  15.20  Schematic  diagram  of  a  charge  coupled  device  (CCD)  photodetector  array 
fabricated  on  a  silicon  substrate.  Electrostatic  potential  wells  are  created  by  applica¬ 
tion  of  appropriate  voltages  to  the  three  phase  bias  electrode  structure,  with  electrical 
isolation  provided  by  the  gate  oxide  layer.  Light  incident  through  the  transparent 
electrodes  creates  stored  charge  that  can  be  transferred  to  an  output  signal  terminal 
by  proper  sequential  phasing  of  the  bias  voltages  (P)  -  P3). 

Fig.  15.21  Block  diagram  of  a  1-D  to  1-D  photonic  neural  network,  in  which  a  one¬ 
dimensional  neuron  array  is  fully  interconnected  to  a  one-dimensional  detector  array 
by  means  of  a  two-dimensional  interconnection  mask. 

Fig.  15.22  Block  diagram  of  a  2-D  to  2-D  photonic  neural  network,  in  which  a  two- 
dimensional  neuron  array  is  fully  interconnected  to  a  two-dimensional  output  array 
by  means  of  a  three-dimensional  volume  holographic  optical  interconnection  mask. 
The  input  plane,  output  plane,  and  optional  training  plane  are  shown.  Many  variants 
of  this  geometry  with  similar  properties  are  possible. 

Fig.  15.23  A  single  layer  physical  neural  network  with  feedback,  used  to  implement  a 
multilayer  recurrent  functional  network.  The  solid  boxes  indicate  feedforward  con¬ 
nections,  and  the  broken  boxes  indicate  lateral  connections. 

Fig.  15.24  Incoherent /coherent  technique  for  recording  and  reconstructing  multiple  holo¬ 
grams  simultaneously,  in  which  all  solid  lines  represent  mutually  coherent  beams,  and 
all  broken  lines  represent  a  separate  set  of  mutually  coherent  beams:  (a)  recording; 
(b)  reconstruction;  and  (c)  holographic  representation,  in  which  each  hologram  rep¬ 
resents  the  fanout  from  a  given  neuron  unit. 

Fig.  15.25  Doubly  angularly  multiplexed  volume  holographic  optical  interconnection,  de¬ 
signed  to  circumvent  the  effects  of  beam  degeneracy.  The  mutually  incoherent  input 
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beams  ({xj})  are  angularly  multiplexed  over  j,  as  are  the  corresponding  sets  of  out¬ 
put  beams  from  the  training  plane  ({<f>jJ>})  generated  by  the  coherent  sources  Sj.  to 
produce  an  angularly  multiplexed  fan-in  at  each  summed  output,  thus  yielding  the 
neuron  activation  potentials  {p,}. 

Fig.  15.26  Photonic  architecture  for  neural  network  implementation  that  incorporates  a 
parallel  source  array,  double  angular  multiplexing,  and  incoherent/coherent  recording 
and  reconstruction:  the  Hebbian  case  is  depicted. 

Fig.  15.27  Photonic  architecture  for  neural  network  implementation:  recording  configu¬ 
ration.  This  configuration  implements  the  learning  function  in  the  photonic  archi¬ 
tecture  of  Figure  15.26.  The  sets  of  beams  emitted  from  the  source  array  (two  are 
shown)  interfere  in  the  volume  holographic  medium  to  update  the  weights  stored  in 
the  interconnection  holograms. 

Fig.  15.28  Photonic  architecture  for  neural  network  implementation:  reconstruction  con¬ 
figuration.  This  configuration  implements  a  single  forward  pass  of  the  computing 
function  in  the  photonic  architecture  of  Figure  15.26.  The  lower  set  of  beams  acts 
as  a  set  of  reference  beams,  and  generates  a  set  of  weighted  output  arrays  that  are 
imaged  onto  the  detector  array.  Each  stored  hologram  is  reconstructed  by  a  single 
neuron  unit  x;,  and  fans  out  with  appropriate  weights  to  illuminate  the  detector 
array.  The  full  set  of  reconstructed  holograms  sums  within  each  pixel  to  yield  the 
neuron  activation  potentials  {p,}. 

Fig.  15.29  Generalized  photonic  architecture  for  neural  network  implementation,  includ¬ 
ing  provision  for  the  generation  of  arbitrary  training  terms  (<$,)  . 

Fig.  15.30  Schematic  diagram  of  a  dual-input,  dual-output  differential  amplifier  that 
effects  a  sigmoid-like  transfer  characteristic. 

Fig.  15.31  Experimentally  obtained  transfer  characteristics  from  the  circuit  shown  in  Fig¬ 
ure  15.30,  showing  the  output  voltage  in  both  channels  (Voutl  and  Vout2)  as  a  function 
of  one  input  voltage  (Vjnl),  with  the  other  input  voltage  (V/tn2)  as  a  parameter. 
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Fig.  15.32  Optical  layout  for  copying  the  entire  contents  of  a  three-dimensional  vol¬ 
ume  holographic  optical  element  (VHOE)  into  a  second  YHOE.  utilizing  a  two- 
dimensional  array  of  individually  coherent,  but  mutually  incoherent  sources. 

Fig.  15.33  The  single  pixel  probability  of  error  P[Err )  as  a  function  of  the  number  of 
photons  detected  within  each  pixel,  for  the  cases  of  100  and  1000  analog  grey  levels. 

Fig.  15.34  Illustration  of  parallel-to-serial  conversion  in  two-dimensional  detector  arrays 
such  as  the  charge-coupled-device  (CCD)  array.  Charge  accumulated  within  each 
photosensitive  region  during  exposure  is  transferred  laterally  by  a  set  of  row-parallel 
shift  registers  to  a  high  speed  parallel-to-serial  shift  register,  which  reads  out  the 
entire  array  one  column  at  a  time. 
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Numerous  applications  have  been  envisioned  for  spatial  light  modulators  in  optical 
information  processing  and  computing  systems  [1,  2].  These  applications  can  be  sum¬ 
marized  within  the  context  of  a  generalized  optical  information  processor  or  computer  as 
shown  schematically  in  Fig.  1  [2].  The  principal  functional  roles  of  both  one-  and  two- 
dimensional  spatial  light  modulators  include  those  of  format,  input,  output,  CPU,  and 
memory  devices.  In  addition,  spatial  light  modulators  can  be  effectively  utilized  to  provide 
certain  types  of  feedback  interconnections,  a s  for  example  in  the  case  of  optical  crossbar 
switches  and  holographically  encoded  weighted  interconnections. 

Such  a  wide  variety  of  applications  has  of  course  led  to  an  equally  wide  variety  of  in¬ 
terrelated,  and  at  times  conflicting,  device  requirements.  In  this  presentation,  we  examine 
these  requirements  from  three  complementary  perspectives:  fundamental  physical  limita¬ 
tions  that  affect  the  performance  of  any  spatial  light  modulation  function;  the  current 
status  of  spatial  light  modulator  development  with  respect  to  such  fundamental  limits; 
and  technological  considerations  that  impact  present  and  future  device  design  and  devel¬ 
opment.  Each  of  these  three  perspectives  will  be  discussed  in  detail,  and  is  outlined  briefly 
below. 

Study  of  the  fundamental  physical  limitations  that  affect  an  emerging  technology  is  at 
once  an  exciting  and  somewhat  sobering  endeavor.  The  excitement  arises  naturally  from 
the  discovery  of  what  we  can  in  fact  achieve;  the  sobering  impact  often  occurs  with  the  real¬ 
ization  of  what  we  have  in  fact  achieved.  Numerous  such  fundamental  physical  limitations 
pertain  to  the  process  of  spatial  light  modulation.  For  example,  two  principal  attributes 


of  incident  wavefronts  can  be  conveniently  modulated:  amplitude  and/or  phase.  In  most 
applications,  it  is  desirable  to  modulate  one  or  the  other,  but  not  both.  Yet  these  two 
parameters  are  intimately  related,  such  that  modulation  of  one  has  a  deterministic  impact 
on  the  other  through  the  Kramers- Kronig  relations.  Analysis  of  this  interrelationship  can 
yield  fundamental  limits  on  the  phase/amplitude  cross-talk  anticipated  for  various  physical 
device  configurations,  as  well  as  appropriate  figures  of  merit.  Furthermore,  such  analysis 
reveals  the  potential  for  “dispersion  engineering”,  in  which  one  can  imagine  utilizing  a 
technology  such  as  compound  semiconductor  multiple  quantum  wells  to  tailor  dispersion 
and/or  absorption  curves  to  match  device  requirements.  One  such  example  is  illustrated  in 
Fig.  2,  which  shows  the  related  absorption  and  dispersion  curves  for  two  5  meV  linewidth 
Lorentzian  oscillators  displaced  by  15  meV.  As  can  be  seen,  a  region  of  nearly  linear  index 
variation  with  energy  can  be  generated  at  a  local  minimum  in  the  absorption  profile. 

A  second  important  fundamental  limitation  pertains  to  the  minimum  (quantum  limited) 
energy  required  per  unit  resolution  element  to  achieve  a  given  level  of  modulation  within 
predetermined  accuracy  constraints.  Different  limits  can  be  derived  for  both  analog  and 
digital  spatial  light  modulation  [2,  3,  4,  5],  as  well  as  for  the  important  cases  of  optically 
addressed  and  electrically  addressed  devices.  For  purposes  of  discussion,  consider  the 
case  of  optically  addressed  spatial  light  modulators,  for  which  the  modulation  function 
inherently  involves  a  detection  process  within  each  resolution  element.  The  quantum  limits 
for  binary  switching  are  well  known,  and  are  summarized  for  optical  devices  in  Fig.  3  (after 
P.  W.  Smith,  Ref.  [6]).  For  analog  modulation,  quantum  restrictions  place  much  stricter 
boundary  limitations  on  the  minimum  allowable  photon  flux  for  a  given  pixel  resolution, 
error  rate,  and  device  framing  rate  [2,  3,  4,  5], 

By  comparison,  analog  representations  (as  used  extensively,  for  example,  in  optical 
processors)  require  far  more  energy  than  the  binary  equivalent.  This  is  due  to  the  neces¬ 
sity  of  utilizing  a  much  higher  particle  count  (electrons  or  photons)  in  order  to  minimize 
the  effects  of  quantum  statistical  fluctuations  on  the  bit  error  rate  (BER).  For  exam¬ 
ple,  if  we  wish  an  analog  representation  of  the  number  1,000,  then  we  require  a  dynamic 
range  of  at  least  1,000:1.  For  incoherent  illumination,  quantum  fluctuations  in  the  emis¬ 
sion/detection  process  produce  a  photon  number  distribution  with  a  relative  standard 
deviation  of  a  =  ^/N/N.  The  equivalent  of  a  10-9  BER  for  the  digital  case  corresponds 
to  roughly  12  standard  deviations.  Therefore,  the  number  of  photons  required  must  be 
greater  than  1.5  x  108  from  statistical  considerations  alone,  as  illustrated  in  Fig.  4.  For  a 
GaAs  semiconductor  laser  characterized  by  a  photon  energy  of  ~  1.5  eV,  this  corresponds 
to  about  1010  ksT.  To  represent  1,000  optically  in  binary  requires  approximately  14  bits 
(10  bits  for  the  number  plus  4  bits  of  overhead)  -at  15  eV  each  (10  photons  at  1.5  eV  each, 
assuming  direct  detection  and  an  ideal  detector),  or  about  104  kgT. 

For  an  optically  addressed  spatial  light  modulator  operating  over  a  dynamic  range  of 


1000:1  with  1000  x  1000  elements  in  an  active  area  of  1  cm2,  and  at  a  frame  rate  of 
1  MHz,  the  detected  input  flux  must  exceed  9  W/cm2  in  order  to  achieve  a  10-9  BER 
in  each  pixel.  Considerations  such  as  these  have  significant  implications  for  the  SLM 
design,  particularly  with  regard  to  sensitivity  at  a  given  dynamic  range.  These  implications 
extend  to  the  system  design  as  well,  in  which  case  it  is  important  to  distinguish  between 
single  and  multiple  iteration  algorithms  (with  different  BER  requirements  at  each  stage). 
Furthermore,  in  many  applications  the  algorithm  is  really  a  form  of  contraction  mapping, 
in  which  the  desired  output  space  may  be  a  very  small  subset  of  the  transformed  input 
space  (e.g.  determining  whether  or  not  a  given  correlation  peak  exceeds  a  predetermined 
threshold).  In  such  cases,  significantly  relaxed  demands  may  be  appropriate  for  the  BER 
of  individual  SLM  pixels. 

For  envisioned  systems  applications  involving  one  or  more  spatial  light  modulators, 
analysis  of  fundamental  limitations  such  as  that  described  above  is  essential  for  determin¬ 
ing  whether  an  analog  or  digital  approach  is  favorable  from  the  point  of  view  of  a  fixed 
input  power  budget  at  a  given  desired  computational  throughput  rate.  In  general,  a  given 
processing  or  computation  function  can  be  partitioned  into  the  cost  (energy  or  otherwise) 
of  representation,  the  cost  of  computation,  and  the  cost  of  detection  and  utilization  of 
the  answer.  For  operation  at  the  quantum  limits,  analog  representations  are  favored  for 
architectures  and  algorithms  that  implement  a  high  degree  of  computational  complexity 
(irreducible  number  of  equivalent  binary  operations)  per  unit  detected  output  resolution 
element,  whereas  binary  representations  favor  operations  with  a  somewhat  lower  degree  of 
computational  complexity. 

The  current  status  of  spatial  light  modulator  development  depends  strongly  for  its  as¬ 
sessment  on  the  nature  of  the  application  and  its  resultant  requirements.  It  is  interesting 
to  note  at  the  outset  that  for  certain  applications  (such  as  incoherent-to-coherent  conver¬ 
sion  with  high  analog  accuracy),  spatial  light  modulator  technologies  have  been  developed 
which  approach  quantum  limited  performance.  On  the  other  hand,  a  broad  spectrum  of 
applications  exists  for  which  current  spatial  light  modulators  fall  far  short  of  such  ultimate 
performance  boundaries.  In  making  such  comparisons,  it  is  of  critical  importance  to  iden¬ 
tify  interrelated  sets  of  performance  parameters  that  cannot  be  arbitrarily  separated,  and 
to  assess  the  conjoint  figure  of  merit  achievable  within  a  given  device  technology’  (rather 
than  the  minimum  value  obtained  across  many  different  types  of  devices,  or  even  within 
the  same  device  under  different  operating  conditions). 

A  large  number  of  technological  considerations  apply  to  the  eventual  incorporation 
of  spatial  light  modulators  in  optical  information  processing  and  computing  systems.  For 
systems  of  given  size,  spatial  frequencies  are  inherently  limited  by  the  acceptance  apertures 
of  finite  F-number  lenses.  Phase  modulation,  which  is  capable  of  much  larger  diffraction 
efficiencies  at  a  given  spatial  frequency  than  amplitude  modulation,  is  more  difficult  to 


implement  in  imaging  configurations  and  is  more  sensitive  to  substrate  nonuniformities 
and  polish  figure.  Amplitude  modulation,  on  the  other  hand,  can  present  formidable 
thermal  dissipation  problems  for  applications  involving  spatial  light  modulation  with  high 
optical  throughput  gain.  Applications  exist  for  both  optically  addressed  and  electrically 
addressed  spatial  light  modulators,  with  distinct  requirements  for  each.  In  fact,  several 
recently  conceived  applications  such  as  the  utilization  of  spatial  light  modulators  in  neural 
network  implementations  could  advantageously  employ  both  address  modes  simultaneously. 

Electrically  addressed  spatial  light  modulators  lead  quite  naturally  to  inherently  pixe- 
lated  structures.  Such  structures  can  be  advantageous  from  several  points  of  view,  including 
the  convenient  merging  of  optical  and  electrical  inputs,  interpixel  cross-talk,  strictly  lim¬ 
ited  space-bandwidth  product,  and  fixed  system  registration.  However  pixelation  can  yield 
additional  difficulties  such  as  inherently  incomplete  fill  factors,  scattering  from  metallic 
interconnections,  and  fixed  pattern  noise.  Similar  types  of  considerations  apply  to  the 
utilization  of  reflective  as  opposed  to  transmissive  device  geometries. 

A  final  technological  consideration  that  will  continue  to  strongly  affect  device  design 
and  development  is  that  of  available  optical  sources,  both  CW  and  pulsed.  For  fixed  (and 
usually  limited)  external  power,  size,  and  weight  considerations,  the  maximum  achievable 
pulsed  energy  densities  available  set  stringent  requirements  on  the  magnitudes  of  usable 
higher  order  material  nonlinearities.  Recent  achievements  of  enhanced  nonlinearities  in 
both  bulk  and  multiple  quantum  well  compound  semiconductor  structures,  as  well  as  in 
nonlinear  organic  polymers,  lend  importance  to  the  question  of  whether  or  not  \(3)  materi¬ 
als  as  well  as  \(2)  materials  will  provide  useful  spatial  light  modulation  functions  in  future 
devices. 
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Fig.  1  Schematic  diagram  of  the  principal  elements  of  a  generalized  optical  processor  or 
computer. 
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Fig.  2  Absorption  and  dispersion  of  a  two  oscillator  system,  with  5  meV  linewidths 
displaced  by  15  meV. 
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Fig.  3  Power  dissipation  per  bit  as  a  function  of  switching  time,  showing  several  funda¬ 
mental  and  technological  limitations  (after  Ref.  6). 
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Fig.  4  The  probability  of  error  P(ERR)  as  a  function  of  the  number  of  photons  detected 
in  a  given  pixel  of  an  optically  addressed  spatial  light  modulator,  for  both  100:1  and  1000:1 
dynamic  range. 
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Summary 


Numerous  applications  have  been  envisioned  for  spatial  light  modulators  in  optical 
information  processing  and  computing  systems.  The  principal  functional  roles  of  both  one-  and 
two-dimensional  spatial  light  modulators  include  those  of  format,  input,  output,  CPU,  and 
memory  devices.  Such  a  wide  variety  of  applications  has  of  course  led  to  an  equally  wide  variety 
of  interrelated,  and  at  times  conflicting,  device  requirements  that  arise  horn  fundamental  physical 
limitations  as  well  as  current  technological  drawbacks.  In  this  investigation  we  have  examined 
these  requirements  for  a  particular  class  of  modulators,  the  operation  of  which  is  based  on  an 
asymmetric  Fabry-Perot  etalon  with  a  multiple  quantum  well  (MQW)  structure  as  the  cavity 
medium. 

The  use  of  such  devices  as  reflection  modulators  based  on  AlGaAs/GaAsMQW  structures 
has  been  investigated  by  a  number  of  authors1  One  of  the  primary  advantages  of  such 
structures  is  the  considerable  increase  in  both  dynamic  range  of  modulation  and  contrast  ratio 
achievable  at  a  given  applied  field  strength  in  comparison  with  previously  investigated  transmissive 
configurations,  in  addition  to  the  elimination  of  the  necessity  for  counteretching  of  the  (absorptive) 
GaAs  substrate.  On  the  other  hand,  the  dynamic  range  achieved  in  any  reflection  configuration 
must  be  assessed  in  conjunction  with  the  6  dB  Insertion  loss  characteristic  of  the  intensity  (rather 
than  polarization)  beamsplitter  required  to  implement  normal  incidence  readout.  This  factor  alone 
rescales  an  80%  change  in  reflectivity  to  less  than  20%  when  considered  from  the  perspective  of 
system  rather  than  device  throughput.  Furthermore,  such  a  reflection  configuration  presents  the 
additional  systems  complexity  of  source  isolation  within  each  stage  of  a  (multistage)  processor. 

One  possible  means  of  avoiding  the  6  dB  insertion  loss  is  to  employ  a  polarizing 
beamsplitter  in  conjunction  with  a  quarter  wave  plate,  thereby  insuring  a  90°  rotation  of  the 
incident  polarization  on  reflection.  This  configuration  adds  system  complexity  and  cost,  as  well  as 
additional  component  requirements  such  as  the  extinction  ratio  of  the  polarizing  beamsplitter,  and 
the  off-axis  uniformity  of  the  net  phase  retardation  induced  by  the  quarter  wave  plate.  Such 
requirements  must  be  set  so  as  to  not  compromise  the  contrast  ratio  designed  into  the  reflective 
spatial  light  modulator.  Alternatively,  one  can  employ  MQW  structures  specifically  designed  to  be 
operated  at  other  than  normal  incidence,  with  the  concomitant  complications  implied  by  the 


associated  anamorphic  optics,  lateral  rescaling,  depth  of  field,  and  placement  of  critical  and 
conjunctive  components  with  appropriately  low  f-numbers. 

In  order  to  explore  asymmetric  cavity  MQW  structures  that  are  capable  of  transmissive 
mode  operation  without  resort  to  the  process  difficulty  and  sample  fragility  implied  by 
counteretching,  we  have  examined  the  prospects  of  utilizing  electric-field-addressed  multiple 
quantum  wells  fabricated  in  the  InGaAs/GaAs  system  on  GaAs  substrates.  In  this  investigation, 
we  have  taken  the  perspective  that  identification  of  fundamental  physical  limitations  to  device 
performance  is  exceedingly  useful  as  a  mechanism  for  the  delineation  (and  potential  circumvention) 
of  limitations  that  arise  from  purely  technological  choices.  These  considerations  are  of  particular 
importance  in  the  InGaAs/GaAs  system  due  to  the  fact  that  the  considerable  lattice  mismatch 
characteristic  of  useful  indium  compositions  results  in  a  strained  layer  growth  environment  with 
concomitant  implications  on  growth  morphology  and  the  necessary  incorporation  of  strain  relief 
mechanisms  (such  as  growth  on  patterned  substrates4).  In  this  paper,  we  describe  several 
important  design  criteria  that  have  evolved  from  this  study,  and  apply  them  to  two  specific  cases: 
(a)  a  transmission  modulator  with  optimized  dynamic  range  and  contrast  ratio,  and  (b)  a  reflection 
configuration  in  which  the  readout  illumination  is  incident  through  the  (antireflection  coated) 
substrate.  This  latter  configuration  yields  significantly  improved  performance  characteristics  as 
well  as  novel  operational  features  at  a  considerable  reduction  in  device  fabrication  complexity. 

An  important  fundamental  physical  limitation  pertains  to  the  degree  of  spatial  light 
modulation  that  can  be  achieved  per  unit  change  in  a  given  modulation  parameter.  There  are  two 
characteristics  of  wavefronts  that  can  be  conveniently  modulated:  amplitude  and  phase.  In  many 
optical  information  processing  applications  it  is  desirable  to  modulate  one  or  the  other,  but  not 
both.  As  an  example  of  a  "pure"  amplitude  modulator  we  have  utilized  a  Lorentzian  oscillator 
profile  to  model  a  typical  heavy  hole  exciton  (neglecting  the  band  edge  absorption)  for  an 
InGaAs/GaAs  MQW  structure  with  a  peak  absorption  of  10,000  cm1  and  a  linewidth  (full  width  at 
half  maximum)  of  7.5  meV  (derived  from  experimental  data  on  a  27  period  MQW  structure  with  35 
monolayers  of  InGaAs  (15%  In)  and  71  monolayers  of  GaAs  grown  by  molecular  beam  epitaxy 
(MBE)),  as  shown  in  Fig.  1.  Similarly,  the  response  of  the  absorption  spectrum  to  an  applied  bias 
due  to  the  quantum  confined  Stark  effect  (QCSE)  for  a  typical  applied  voltage  of  about  100  kV/cm 
and  for  a  total  device  thickness  of  approximately  1.2  pm  (100  A  wells/  200  A  barriers)  is  also 
shown  (dashed)  in  Fig.  1.  Again,  experimental  data  were  used  to  quantize  the  decrease  in 
oscillator  strength  as  well  as  the  exciton  broadening  effects  that  are  typical  for  such  systems.  This 
model  illustrates  the  basis  of  operation  of  a  normally-on  amplitude  modulator  that  utilizes 
electroabsorption  at  the  chosen  wavelength  (0.985  pm)  to  generate  the  off-state,  resulting  in  a 
maximum  achievable  change  in  absorption  of  5500  cnr1.  In  addition  to  this  constraint,  other 
parameters  that  determine  optimum  performance  include  the  device  thickness  and  the  mirror 
reflectivities,  as  well  as  the  width  and  location  of  the  Fabry-Perot  resonance  modes  (which  are  also 
dependent  on  electrorefractive-induced  perturbations  in  shape  and  position).  Related  to  the  device 
thickness  is  a  critical  cavity  phasing  condition,  which  can  be  set  to  an  odd  or  even  multiple  of  it, 
thus  giving  rise  to  two  fundamentally  different  operational  modes. 

For  the  transmission  modulator  configuration,  application  of  the  oscillator  model  with  its 
corresponding  index  dispersion  data  as  shown  in  Figs.  1  and  2,  in  conjunction  with  an  optimally 
designed  asymmetric  cavity,  yields  a  combination  of  fundamental  and  technological  considerations 
that  in  turn  limit  the  dynamic  range  to  50%  with  a  contrast  ratio  of  10: 1,  as  shown  in  Fig.  3.  In 
this  case,  the  mirror  reflectivities  are  both  equal  to  85%.  It  should  be  noted  that  the  signal 
dependent  phase  modulation  is  limited  in  this  case  to  approximately  0.03x.  The  fundamental 
reason  for  die  relatively  poor  contrast  ratio  is  the  necessity  in  the  transmission  configuration  to 
phase  the  cavity  such  that  in  the  on  (low  absorption)  state,  the  first  and  second  order  transmitted 
components  add  in  phase.  Thus,  extinction  in  the  off  state  relies  on  the  development  of  enough 
voltage-induced  absorption  change  at  the  operating  wavelength  to  completely  eliminate  the  first 


order  beam  on  a  single  pass  through  the  modulator.  The  peak  absorption  coefficient  characteristic 
of  the  InGaAs/GaAs  system  at  this  composition  is  insufficient  to  produce  more  than  an  10:1 
change  with  a  reasonably  wide  dynamic  range.  It  should  be  noted  in  Fig.  3  that  the  shorter 
wavelength  results  are  somewhat  artificial  due  to  neglect  of  the  parabolic,  background  and  residual 
absorption  terms. 

On  the  other  hand,  by  utilizing  a  low  reflectivity  (25%)  incident  mirror  in  conjunction  with 
a  high  reflectivity  back  mirror,  the  best  possible  dynamic  range  achievable  in  reflection  is  70%  with 
a  contrast  ratio  of  greater  than  25:1.  as  shown  in  Fig.  4.  This  configuration  is  characterized  by  an 
operational  bandwidth  of  about  50  A  and  an  insertion  loss  of  approximately  1.5  dB,  without  taking 
into  account  the  system  throughput  loss  necessitated  by  the  incident  beamsplitter.  If  a  polarization- 
independent  beamsplitter  is  employed,  the  system  throughput  loss  in  this  case  is  about  8  dB  with  a 
reduced  dynamic  range  of  17.5%  at  the  same  contrast  ratio. 

Should  a  reflection  modulator  configuration  prove  desirable,  the  transparency  afforded  by 
the  InGaAs/GaAs  MQW  system  on  a  GaAs  substrate  can  be  used  to  advantage  by  inverting  the 
traditional  device  structure  to  incorporate  the  low  reflectivity  incident  miror  as  a  few  period  Bragg 
mirror  (typically  AlGaAs/GaAs)  capped  during  growth  by  the  strained  layer  MQW  structure.  The 
device  may  then  be  completed  by  deposition  of  either  a  metal  (affording  both  a  contact  as  well  as 
RF  isolation)  or  dielectric  stack  high  reflectivity  mirror  on  the  top  surface.  The  principal  advantage 
of  this  structural  configuration  is  the  elimination  of  the  requirement  for  a  (typically)  20  or  more 
period  Bragg  reflector  grown  prior  to  the  precision  growth  stage  necessitated  by  the  desired 
thickness  of  active  MQW  material.  Not  only  does  this  significantly  affect  the  device  processing 
throughput  and  yield,  but  it  also  greatly  enhances  the  quality  of  the  final  interface  prior  to  the 
initiation  of  MQW  growth. 

Additional  technological  considerations  affecting  the  implementation  of  SLM’s  with 
InGaAs/GaAs  MQW  structures  include  issues  of  growth  thickness  accuracy  (such  as  process 
variability  and  array  uniformity),  which  particularly  affect  two  critical  design  parameters:  namely, 
the  oscillator  position  and  linewidth,  and  the  cavity  resonance  condition.  Background  doping 
concentrations  arc  also  known  to  be  particularly  important  in  maximizing  the  contrast  ratio,  as  well 
as  the  voltage  division  between  any  incorporated  Bragg  mirror  (with  >  kgT  band  discontinuities) 
and  the  MQW  active  region. 
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Introduction 

Optical  memory  disc  technology  has  recently  matured  and  penetrated  the  personal  computer 
industry  market  with  high  density  and  extremely  high  capacity  serial  data  storage  devices.  The 
high  resolution  (submicron)  and  long  life  time  (>  10  years)  characteristic  of  the  optical  disc 
medium  in  combination  with  image-based  recording  and  an  appropriate  parallel  readout  scheme 
also  make  this  advanced  technology  a  very  competitive  alternative  for  spatial  light  modulation. 

Utilization  Concept 

There  are  numerous  possible  ways  of  utilizing  the  optical  memory  disc  as  a  spatial  light 
modulator  (SLM).  One  such  configuration  is  as  a  binary  (half-tone)  memory  with  a  parallel  readout 
capacity  of  up  to  108  bits/cm2.  This  use,  of  course,  implies  strict  design  constraints  on  the 
resolution  of  the  readout  optics  in  order  to  fully  utilize  this  capacity.  More  traditional  SLM 
functions  can  be  incorporated  by  using  the  optical  disc  as  a  binary-encoded  analog  image 
memory1,  in  which  grey  scale  is  obtained  at  the  expense  of  resolution.  Assume,  for  simplicity,  a 
bit  density  of  108  bits/cm2  (which  is  within  reach  of  current  optical  disc  technology)  and  an  image 
size  of  1  cm  x  1  cm.  The  image  can  then  be  divided  into  1000  x  1000  pixels,  each  of  size  10  pm  x 
10  pm  with  10  x  10  bits  for  grey  scale  generation.  By  random  area  encoding  the  serially  written 
bits  in  each  pixel,  a  particular  grey  scale  level  directly  proportional  to  the  total  number  of  bits  in 
each  pixel  can  be  obtained.  The  theoretical  dynamic  range  for  this  case  is  100: 1 ,  corresponding  to 
approximately  6.6  grey  scale  bits,  thus  resulting  in  a  total  information  density  of  *  6.6  Mbits/cm2. 
Note  that  it  is  now  only  necessary  to  resolve  each  pixel  (10  pm  x  10  pm)  in  order  to  obtain  the 
desired  continuous  grey  level  in  each  pixel. 

Optical  Readout  Approach 

In  order  to  implement  parallel  (two-dimensional)  readout  of  optical  memory  discs,  it  is 
desirable  to  employ  direct  imaging  of  the  optical  disc  to  obtain  the  maximum  achievable  throughput 
efficiency  and  at  die  same  time  obtain  a  high  contrast  ratio  in  the  output  image.  An  alternative 
approach  is  to  read  out  the  optical  disc  in  parallel  holographically2  by  storing  the  images  in  the 
form  of  two-dimensional  computer  generated  holograms.  However,  holographic  readout  implies  a 
non-optimum  trade-off  between  throughput  efficiency  and  SNR  for  a  given  space-bandwidth 
product  An  optical  readout  configuration  which  has  an  inherently  high  SNR,  is  independent  of 
the  groove  profile  (required  for  the  tracking  servo-system  of  most  disc  drivers),  and  can  be  utilized 
with  a  variety  of  recording  layer  principles  (such  as  ablation  or  bump  forming)  can  be  achieved  by 
using  a  differential  (or  shearing)  interferometric  approach1.  In  this  readout  scheme,  two  spatially 
coherent  images  of  the  optical  disc  are  created  and  interfered  differentially  by  introducing  a  shift  of 
one  half  of  the  minimum  bit  pitch  along  the  direction  of  the  grooves  of  a  distance  (Fig.  la).  In  the 
output  image,  only  local  differences  in  phase  (height)  and/or  reflectivity  of  the  recording  layer  will 
be  detected.  Thus  each  bit  will  produce  two  bright  marks  (A/2  apart)  on  a  dark  background  in  the 
output  image  (Fig.  lb).  Maximum  output  intensity  will  consequently  be  obtained  for  recording 
with  a  30%  duty  cycle  pattern. 


Optical  Readout  Configuration 

The  differential  interferometric  readout  concept  can  be  implemented  by  using  a  shear  plate, 
which  consists  of  a  birefringent  plate  with  a  tilted  optic  axis  (Fig.  2).  When  imaging  the  optical 
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disc  through  the  shear  plate,  two  images  with  orthogonal  polarizations  are  produced  and  are  shifted 
by  an  appropriate  amount  as  determined  by  the  recorded  bit  length.  In  order  to  obtain  the  optimum 
phase  shift  of  Jt  ±  2nn  (n  is  an  integer)  between  the  two  images,  the  optical  path  length  difference 
between  the  two  orthogonal  polarizations  can  be  set  by  tilting  the  shear  plate,  or  pre-adjusted  by  a 
variable  compensator.  The  output  analyzer  produces  the  desired  interference  between  the  two 
orthogonally  polarized  images.  In  practice,  the  shear  plate  tilt  (and/or  compensator)  and  the 
analyzer  are  tuned  for  maximum  extinction  of  the  reflected  light  from  an  unwritten  area  on  the  disc 
in  order  to  cbt?in  the  highest  possible  contrast  ratio  in  the  output  image. 

Design  Considerations 

The  optical  readout  configuration  shown  in  Fig.  2  is  very  flexible  in  design.  The  crucial 
element,  the  shear  plate,  can  be  manufactured  in  several  ways.  For  example,  a  uniaxial  crystal  plate 
with  a  tilted  optic  axis  can  be  used  as  shown  in  Fig.  3.  The  shear,  S,  of  such  a  plate  is  given  by 

~n2o) tan0c 

^  2  2  2  ’ 
rig  +  rig  tan  Bc 

in  which  d  is  the  width  of  the  plate,  Qc  is  the  optic  axis  tilt  angle,  and  nQ  and  ne  are  the  ordinary  and 
extraordinary  indices  of  refraction  of  the  uniaxial  crystal,  respectively.  Since  S  is  to  first  order 
directly  proportional  to  An  =  ne  -  n0,  and  since  the  shear  required  (jim-range)  is  «  d,  it  is  desirable 
to  choose  a  uniaxial  crystal  with  a  small  birefringence.  Quartz,  for  example,  has  an  index 
difference  An  *  0.009  (at  A  =  632.8  nm),  and  in  addition  exhibits  good  polishing  characteristics. 

SLM  Output  Characteristics 

As  test  patterns,  three  bands  were  written  on  two  different  types  of  discs:  write-once 
ablative  media  discs  (amorphous  alloy)  of  the  dark  mark  type,  and  erasable  bump  forming  media 
(dye  polymer)3.  Each  band  was  approximately  0.5  mm  in  width  and  written  at  a  duty  cycle 
corresponding  to  a  specific  grey  scale  level.  The  three  duty  cycles  used  were  50%  (maximum 
output  intensity),  25%  and  12.5%,  all  written  using  5  iim  long  bits.  Using  the  differential 
interferometric  readout  configuration  with  a  5  pm  shear,  the  three  bands  for  both  media  were 
clearly  extracted  with  the  appropriate  grey  scale  level  factor  of  two  between  adjacent  bands.  The 
measured  output  intensity  normalized  to  the  highest  grey  scale  level  (the  50%  duty  cycle  band)  as  a 
function  of  duty  cycle  for  both  media  is  shown  in  Fig.  4.  The  SLM  output  characteristic  is  seen  to 
be  very  linear  for  both  media.  However,  the  currently  obtained  contrast  ratio  is  much  higher  for 
the  bump  forming  medium  (»  20: 1)  than  for  the  ablative  medium  (=**  5:1).  The  main  reason  for  the 
lower  contrast  ratio  for  the  ablative  medium  is  due  to  interference  from  the  other  (transmissive) 
recording  layer  on  the  two-sided  optical  disc.  The  ablative  recording  layer  is  furthermore  read  out 
through  a  non-AR  coated  polycarbonate  substrate^  1.2  mm  thick)  that  may  exhibit  some  residual 
birefringence.  The  bump  forming  medium,  on  the  other  hand,  has  a  non-transmissive  air  incident 
recording  layer.  The  contrast  ratio  of  the  bump  forming  medium  could  be  further  increased  by 
using  high  quality  (glass)  substrates  and  higher  uniformity  recording  layers  with  fewer 
(depolarizing)  surface  blemishes. 


Differential  Interferometric  Readout  Performance 

The  maximum  throughput  efficiency  for  an  optical  disc  medium  in  an  otherwise  ideal 
differential  interferometric  readout  configuration  is  given  by 


h medium  ~ 


\ro\2  +  hi2  -  2  •  M  •  \n\  ■  cos^JW  •  Y  +  arg{r0}  -  arg{r; }j  , 


in  which  rg  and  r/  are  the  (complex  valued)  amplitude  reflectivities  of  the  (unwritten)  recording 
layer  and  the  written  bit  respectively,  h  is  the  height  of  the  written  bit  and  n  is  the  refractive  index 
of  the  optical  disc  substrate  (for  substrate  incident  media).  The  highest  obtainable  efficiency  is 
achieved  for  an  (ideal)  bump  forming  medium  (shown  in  Fig.  5)  for  which  rg  =  r/  and  h  = 
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±X/4n.  That  is,  the  efficiency  is  directly  proportional  to  irfll2  with  a  unity  constant  (the  efficiency 
factor  is  unity).  However,  for  an  ideal  dark  mark  ablative  medium  (rj  =  0  and  h  =  0),  the 
efficiency  is  directly  proportional  to  lr0l2/4;  that  is,  the  efficiency  factor  is  1/4.  Most  ablative  media 
in  fact  comprise  amorphous  alloy  recording  layers  with  complex  valued  indices  of  refraction 
characterized  by  a  non  zero  bit  reflectivity  (rj),  which  may  in  fact  boost  the  effective  efficiency 
factor  beyond  1/4.  For  our  experimental  readout  configuration,  the  total  measured  throughput 
efficiency  (relative  to  the  collimated  laser  beam)  of  the  SLM  for  the  two  media  was  0.9%  for  the 
bump  forming  medium  and  3.0%  for  the  ablative  medium3.  These  measurements  however 
included  the  losses  in  the  readout  optics  (polarizing  beam  splitter,  shear  plate  and  imaging  lenses) 
for  which  the  throughput  efficiency  was  independently  measured  to  be  0.85.  The  remaining  losses 
can  be  divided  into  two  separate  parameters:  losses  due  to  the  optical  disc  medium  (limited 
recording  layer  reflectivity,  air-substrate  interface  reflection  loss  and  diffraction  from  grooved 
substrates)  and  loss  due  to  the  data  pattern  itself,  which  involves  the  shape  and  size  of  the  written 
bits  (efficiency  factor,  fill  factor  of  bits,  and  diffraction  losses  due  to  limited  numerical  aperture). 
The  optical  disc  medium  losses  are  mainly  determined  by  the  reflectivity  of  the  recording  layer, 
which  was  40%  for  the  ablative  medium  and  13%  for  the  bump  forming  medium.  In  another 
experiment,  the  bump  forming  medium  was  also  coated  with  a  30  nm  aluminum  layer,  which 
boosted  the  reflectivity  to  86%  and  the  total  throughput  efficiency  to  5.8%.  The  total  data  pattern 
efficiencies  were  approximately  the  same  for  both  media  (0.11  for  the  ablative  and  0.08  for  the 
bump  forming).  The  relatively  low  data  pattern  efficiency  for  the  bump  forming  medium  is  due  to 
the  non-ideal  shape  of  the  bumps  and  a  slightly  lower  fill  factor  of  *  0.40  versus  0.63  for  the 
ablative  medium.  By  increasing  the  fill  factor  and  the  numerical  aperture  of  the  readout  optics,  and 
by  optimizing  the  AR-coatings,  the  (complex  valued)  recording  layer  reflectivity  (e.g.  by  using 
separate  write  and  parallel  read  wavelengths)  and  the  bump  shape  (if  applicable),  the  total 
throughput  efficiency  can  be  substantially  increased. 
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Figure  2.  Optical  readout  configuration.  Figure  3.  Shear  plate. 


Figure  4a.  SLM  output  characteristic.  Figure  4b.  SLM  output  characteristic. 
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Figure  5.  Readout  geometry  of  a  bump  forming  optical  disc  medium. 


